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Search Notes 




Dear Examiner Veillard, 

Attached are the search results (from commercial databases) for your case. 

Color tags mark the patents/articles which appear to be most relevant to the case. Pis review 
all documents, since untagged items might also be of interest. If you wish to order the 
complete text of any document, pis submit request(s) directly to the EIC2100 Reference Staff 
located in PK2-4B40. 

Pis call if you have any questions or suggestions for additional terminology, or a different 
approach to searching the case. Finally, pis complete the attached Search Results Feedback 
Form, as the EIC/STIC is continually soliciting examiners' opinion of the search service. 

Thanks, 
Carol 




Requester's Full Name: \fe Itt CTs ..Examiner #•: l7£on Date: 3/2l/o^ 



Art Unit: . 2/75 Phone Number 30 5 ' 7<?7y Serial Number: O^MlS ^^Z 
^ Mail Box and Bldg/Room Location:. ^ ptf Results Format Preferred (circle) :(^APER^DISK E-MAIL 

If more than one search is submitted, please: prioritize searches in order of need. 

**************************************** ***★ ******************* ****** * *.* * ****************** 

Please provide a detailed statement of the search, topic, and describe as specifically as possible the subject<rnatter to be , searched. - 
^Include the elected species. or structures; keywords, synonyms; acronyms, and registry numbers, and combine with the concept or 

utility of the invention. ;-pefine any terms that may have a special meaning. Give examples,or relevant citations, authors, etc, if 
i' 'known. Please attach a copy of the cover sheet, pertinent claims, and abstract. 

, Title of Invention: L/Kj&ulsTlCfrLt-y ft WA &t£ It *j< ^^y-.i file t£*s( f 5ys f* 
Inyentors (please provide full names): Sh ft \T\ \ \\\ ft Op h C\ T 

Earliest Priority Filing Date: % I \ 3 I 2&^J ■ 

*For Sequence Searches Only 1 ' Please include all pertinent information (parent, child, divisional, or issued patent numbers) along with the 
appropriate serial number. A'^" >> '^' . ^ 




******* **^**********************^ 

Type of Search 

NA Sequence, (# ) -4 , STN _ 



STAPF USE ONLY ■ 

i Searcher: - O .. t/v/fWv' 



Searcher Phone #; : <j*\ 



. Searcher Location: '\ 



v-vy 



Vendors and cost where applicable 



, Date Searcher Piclted Up: 
r^te Completed: ~i *^*[ 



Searcher Prep &. Review Time: _ 

;-Glerica1 Prep Time: 

' Online Time: ■ 



<AA Sequence (#)^_ 

Structure.(#) r 

Bibliographic- = 

Litigation . 
Fulltext . _ 
Patent Family 

> . ' y 

Other i 



Dialog-_ 



Questel/Orbit_ 



Dr.Link 
Lexis/Nexis 



Sequence Systems _ 

, WWW/Internet 

Other (specify )' 



PTO-159Q(8-01) 



File EBusiness & Indi^BytR) Jul/1 994-2004 /Mar 25 

(c) 2004 Resp. DB Svcs. 
File 16:Gale Group PROMT (R) 1990-2004 /Mar 26 

(c) 2004 The Gale Group 
File 47:Gale Group Magazine DB(TM) 1 959-2004 /Mar 26 

(e) 2004 The Gale, group 
File 148:Gale Group Trade & Industry DB 197 6-2004 /Mar 26 

(c)2004 The Gale. Group 
File 160: Gale Group PROMT (R) 1972-1989 

(c) 1999 The Gale Group 
File 275:Gale Group Computer DB(TM) 1983-2004 /Mar 26 

(c) 2004 The Gale Group 
File 570:Gale Group MARS (R) 1984-2004 /Mar 26 

(c) 2004 The Gale Group 
File 621:Gale Group New Prod.Annou. (R) 1985-2004/Mar 26 

(c) 2004 The Gale Group 
File 636:Gale Group Newsletter DB(TM) 1987-2004 /Mar 26 

(c) 2004 The Gale Group 
File 649:Gale Group Newswire ASAP(TM) 2004/Mar 25 
(c) 2004 The Gale Group 



Set Items Description 

51 27 62057 CONNECTION? OR CONNECTING? OR CONNECTED OR CONNECT? ? OR I- 

NTERCONNECT? 

52 914281 CITE OR CITES OR CITED OR CITING 

53 4314173 POINT OR POINTS OR POINTED OR POINTER? ? OR POINTING 

54 2266216 LINK? OR HYPERLINK? OR HOTLINK? OR LIVELINK? OR INTERLINK? 

55 4894 RELEVAN? { 1W) (RANK? OR SCORE OR SCORES OR VALUATION? OR RAT- 

ING? OR WEIGHT? ? OR VALUE OR VALUES) 

56 263805 S1:S4{3N) (PAGE OR PAGES OR WEB PAGE? OR WEBSITE? ? OR DOCUM- 

ENT? ? OR ARTICLE? ? OR RECORD? ? OR REPORT? ? OR FILE OR FIL- 
ES) 

57 33955 S1:S4 (3N)OBJECT? ? 

58 145 S5(S)S6:S7 

59 101 S5 (5N) CONTENT 

510 4 S9(S)S6:S7 

511 • 1 RD (unique items) 

512 18 S8/2002:2004 

513 123 S8 NOT (S12 OR S10) 

514 62 RD (unique items) 

515 47 S5(10N) (LINGUIS? OR SYNTAX? OR SYNTACT? OR SEMANTIC?) 

516 0 S5(10N) (SEMIOTIC? OR TAGMEM?) 

517 10 S15/2002:2004 

518 31 S15 NOT (S17 OR S10 OR S8) 

519 18 RD (unique items) 



14/3, K/4 (Item 2 from file: 16) 

DIALOG (R) File 16: Gale Group PROMT (R) 

(c) 2004 The Gale Group. All rts. reserv. 

08427434 Supplier Number: 71560449 {USE FORMAT 7 FOR FULLTEXT) 
Google, Dis intermediation, and the Palm Beach Ballot. (Government Activity) 

Wiggins, Richard 
Searcher, v9, n3, plO 
March, 2001 

Language: English Record Type: Fulltext 
Document Type: Magazine/ Journal ; Professional 
Word Count: 2203 

not the only search engine to use popularity of a page as part of 
the relevancy ranking algorithm, but it may apply this methodology most 
aggressively. Do a search for a common. . . 

...or "White House" or "Jay Leno" — and Google will much more likely find 
you a page endorsed by the links of the global community of Web users. 
The combination of these two simple design elements... 

14/3, K/5 (Item 3 from file: 16) 

DIALOG (R) File 16: Gale Group PROMT (R) 

(c) 2004 The Gale Group. All rts. reserv. 

08030957 Supplier Number: 66101423 (USE FORMAT 7 FOR FULLTEXT) 

Inktomi Serves Up Smart Searches — Looking for excellent retrieval 
quality, power and flexibility in a Web site search engine? Inktomi 
Search Software has what you've been searching for. (Software 
Review) (Evaluation) 

Rappoport, Avi 

Network Computing, p54 

Oct 16, 2000 

Language: English Record Type: Fulltext 
Article Type: Evaluation 
Document Type: Magazine/ Journal ; Trade 
Word Count: 4156 

... License). It's a solid search engine for Unix machines. Ht: //Dig's 

robot crawls links on Web pages and the indexer interfaces with 
open-source code to read PDF and Microsoft Word files. The response is fast 
and the relevance ranking reasonable (it will improve in version 3.2, 
under development as of this writing). There... 

14/3 ,K/ 6 (Item 4 from file: 16) 

DIALOG (R) File 16: Gale Group PROMT (R) 

(c) 2004 The Gale Group. All rts. reserv. 

07025903 Supplier Number: 59281691 (USE FORMAT 7 FOR FULLTEXT) 
Wetware Rules in San Diego: Internet Librarian '99 Conference. (Company 
Operations) 

McDermott, Irene E. 
Searcher, v8, n2, p53 
Feb, 2000 

Language: English Record Type: Fulltext 
Document Type: Magazine/ Journal ; Professional 
Word Count: 37 95 




... almost every time. The folks at Google, Inc. have figured out a way 

to analyze links on Web pages . A site with many links pointing to it 
gets a high relevancy rating . Goggle also considers the importance, 
based on linkage , of the pages that point to the site. And then, of 
course, there's the fact that Google uses AND... 



14/3 ,K/7 (Item 5 from file: 16) 

DIALOG (R) File 16: Gale Group PROMT (R) 

(c) 2004 The Gale Group. All rts. reserv. 

06938168 Supplier Number: 58545238 (USE FORMAT 7 FOR FULLTEXT) 
The Answer Machine, (information services management) (Industry Trend or 
Event) 

Feldman, Susan 

Searcher: The Magazine for Database Professionals, v8, nl, p58 
Jan, 2000 

Language: English Record Type: Fulltext 
Document Type: Magazine/ Journal; Professional 
Word Count: 12581 

center, with each individual sign pointing in a slightly different 
direction. The words in the document all point to specific directions 
in this imaginary landscape. Documents containing similar words will point 
in the . . . 

...angles to give us a degree of similarity. This "vector space model" can 
help calculate relevance ranking , but it can also determine clusters or 
clumps of similar documents. This is the basis... 

14/3, K/8 (Item 6 from file: 16) 

DIALOG (R) File 16: Gale Group PROMT (R) 

(c) 2004 The Gale Group. All rts. reserv. 

06849899 Supplier Number: 57902571 (USE FORMAT 7 FOR FULLTEXT) 
Still Haven't Found What You're Looking For? : Search sites aren't there 
yet, but they're learning. (Internet /Web/Online Service Information) 

Bass, Gordon 
PC/Computing, p78 
Jan, 2000 

Language: English Record Type: Fulltext 
Document Type: Magazine/ Journal; General Trade 
Word Count: 589 

Google ranks its results based on a relevancy algorithm that looks 
at how many links point to a given page so a Star Wars fan site that 

repeats the name of the movie ad infinitum. . . 

...www.hotbot.com), a perennial favorite among the search engine 
cognoscenti, takes the idea of relevancy ranking a step further by 
analyzing its success in past searches. Using Direct Hit technology, HotBot 



14/3, K/10 (Item 8 from file: 16) 

DIALOG (R) File 16: Gale Group PROMT (R) 

(c) 2004 The Gale Group. All rts. reserv. 



06294979 Supplier Number: 54474837 (USE FORMAT 7 FOR FULLTEXT) 
Rising Relevance in Search Engines. 
Notess, Greg R. 
Online, v23, n3, p84 (1) 
May, 1999 

Language: English Record Type: Fulltext 
Document Type: Magazine/ Journal; Trade 
Word Count: 2 606 

... is no easy task. On some searches, these early Web search engines 

worked successfully, providing links to pages that met or came close to 
meeting the searchers 1 information needs. On other searches, the relevant 
hits were buried deep with low relevance scores . 
STANDARD RELEVANCE 

The precise methods that each search engine uses for determining the 
relevance score... 



? tl4/3, k/12, 15-17, 19,21-22 



14/3, K/12 (Item 10 from file: 16) 

DIALOG (R) File 16: Gale Group PROMT (R) 

(c) 2004 The Gale Group. All rts. reserv. 

06294975 Supplier Number: 54474833 (USE FORMAT 7 FOR FULLTEXT) 

The future Search Web of Search. 

Sherman, Chris 

Online, v23, n3, p54(l) 

May, 1999 

Language: English Record Type: Fulltext 
Document Type: Magazine/ Journal ; Trade 
Word Count: 4394 

strives to locate authoritative sources ("authorities") on the Web, 
and use the information to compile relevance rankings . Google, a new 
search engine developed at Stanford, is on the forefront of this work. 
"Google measures link importance, " said Larry Page , co-founder of 
Google. A link to another Web page is almost like a citation in a book. 
Web page authors generally only create links to other pages they think 
are important. Web authors have created thousands of links to Yahoo!, for 
example . . . 

...an important site. Yahoo! becomes even more of an important site if lots 
of other pages with high importance point to it. "It's almost like 
having a peer review process for Web pages," according... 

14/3, K/15 (Item 13 from file: 16) 

DIALOG (R) File 16: Gale Group PROMT (R) 

(c) 2004 The Gale Group. All rts. reserv. 

04548755 Supplier Number: 46685185 (USE FORMAT 7 FOR FULLTEXT) 
PLS Speeds Time to Market for Content-Rich Web Sites With PLWeb Turbo 
Version 2 . 6 

PR Newswire, p0904DCW009 
Sept 4, 1996 

Language: English Record Type: Fulltext 
Document Type: Newswire; Trade 
Word Count: 8 66 

... In response to a query, PLWeb Turbo presents the user with a list 

of hypertext links to "hit" documents listed in order of probable 
relevance to the user, helping users prioritize the search results. The 
relevance - ranking algorithm uses a number of criteria including number 
of hits, proximity of hits in a... 



14/3, K/16 (Item 14 from file: 16) 

DIALOG (R) File 16: Gale Group PROMT (R) 

(c) 2004 The Gale Group. All rts. reserv. 

04336043 Supplier Number: 46358998 (USE FORMAT 7 FOR FULLTEXT) 
PLWeb Turbo speeds intranetwork site development and ease of use. 

Business Wire, p5030009 
May 3, 1996 

Language: English Record Type: Fulltext 
Document Type: Newswire; Trade 



Word Count: 660 



Once a search is completed, PLWeb Turbo presents the user with a 
list of hypertext links to "hit" documents — listed in the order of 
probable greatest importance to the user. Relevance ranking saves users 
from wading through long series of useless data. The relevance - ranking 
algorithm users a number of criteria including number of hits, locations of 
hits in a. . . 



14/3, K/17 (Item 15 from file: 16) 

DIALOG (R) File 16: Gale Group PROMT (R) 

(c) 2004 The Gale Group. All rts. reserv. 

04107314 Supplier Number: 45989957 (USE FORMAT 7 FOR FULLTEXT) 
dtSearch 4.0 and TextBridge 1 OCR by Xerox 1 bundle dtSearch 32-bit version 

for Windows 1 95 and Windows NT dtSearch FindPlus add-on to Norton 

Navigator 

News Release, pN/A 
Dec 6, 1995 

Language: English Record Type: Fulltext 
Document Type: Magazine/ Journal; Trade 
Word Count: 670 

(USE FORMAT 7 FOR FULLTEXT) 
TEXT: 

...of fuzzy, phonic, stemming and wildcard word variations; "plain English" 
natural language search capabilities, including relevancy ranking by 
search term density and rarity; ZIP support; viewing of associated 
graphics; a delete file... 

...numeric range searching. Sharing the same search indexes as Norton 
Navigator, dtSearch 4.0 adds relevancy - ranked natural language searches 
and expands search viewing options, letting users jump from one highlighted 
hit to the next, do second-level text searches, and view hypertext- linked 
search reports . dtSearch 4.0 also adds: ZIP support, imaging, a scrolling 
list of indexed words, and. . . 



14/3,K/19 (Item 1 from file: 47) 

DIALOG (R) File 47: Gale Group Magazine DB (TM) 
(c) 2004 The Gale group. All rts. reserv. 

05981300 SUPPLIER NUMBER: 68875005 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Internet Librarian 2000. 

Hane, Paula J. 

Information Today, 18, 1, 1 

Jan, 2001 

ISSN: 8755-6286 LANGUAGE: English RECORD TYPE: Fulltext 

WORD COUNT: 1757 LINE COUNT: 0014 6 

... they're valid research papers. It uses both citation analysis and 

semantic measures for its relevance ranking of retrieved documents. 
Researchlndex provides users with bibliographic information, author 
information, a link to an author's page, an abstract of the document, the 
context of citations ( document cited by and similar documents ) , and 
citations within the document with links to those. For now, the subject 
is just computer science, but the engine gives us... 




14/3, K/21 (Item 3 from file: 47) 

DIALOG (R) File 47: Gale Group Magazine DB(TM) 
(c) 2004 The Gale group. All rts. reserv. 

05513959 SUPPLIER NUMBER: 58064390 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Northern Light Adds Link Popularity to Its Relevance Ranking Factors List. 

Feldman, Susan 

Information Today, 16, 11, 38 
Dec, 1999 

ISSN: 8755-6286 LANGUAGE: English RECORD TYPE: Fulltext 

WORD COUNT: 581 LINE COUNT: 0004 8 

Search engines add a weight for each of these sorts of measures 
then calculating the relevance ranking of each document. Documents that 
have more occurrences of the query terms, and more terms... 

...still higher, and so does the semantic and syntactic context of the 
query and the document . 

By adding link popularity, Northern Light has mined the implicit 
reviewing mechanism of other Web authors, who link... 



14/3, K/22 (Item 4 from file: 47) 

DIALOG (R) File 47: Gale Group Magazine DB(TM) 
(c) 2004 The Gale group. All rts. reserv. 

05433224 SUPPLIER NUMBER: 55675900 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Government Search Tools : Evaluating Fee and Free Search 

Alternatives . (National Technical Information Service, Northern Light 
create new search tool, usgovsearch.com) (Company Business and 
Marketing) (Bibliography) 

Gordon-Murnane, Laura 

Searcher, 7, 8, 66 

Sept, 1999 

DOCUMENT TYPE: Bibliography ISSN: 1070-4795 LANGUAGE: English 

RECORD TYPE: Fulltext 

WORD COUNT: 6519 LINE COUNT: 00593 

. . . of those common words. 

Sorting 

Google/Unclesam sorts documents by relevance based on how many pages 
link to it and the "importance" of the linking pages the more pages 
that link to the site, the more important the site, hence, it garners a 
higher relevancy ranking . Also, Google/Unclesam 1 s designers have 
organized pages by site additional hits from the same... 
? tl4/3, k/27, 29,42 

14/3,K/27 (Item 9 from file: 47) 

DIALOG (R) File 47: Gale Group Magazine DB(TM) 
(c) 2004 The Gale group. All rts. reserv. 

04632719 SUPPLIER NUMBER: 16530244 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

It f s 10 o'clock: do you know where your data are? (electronic data loss) 

Cook, Terry 

Technology Review, v98, nl, p48(6) 
Jan, 1995 

ISSN: 0040-1692 LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 3310 LINE COUNT: 00264 



only. 



Think of the CEO who sent out her message on investment strategies 
electronically. The interconnections of her compound document are not 
part of what the user sees on the screen, as they would be... 

...or in the operating system. These instruct the computer to query the 
database, drop the relevant values found there into the spreadsheet, 
build a graph using spreadsheet formulas, and place the resulting... 

14/3, K/29 (Item 11 from file: 47) 

DIALOG (R) File 47: Gale Group Magazine DB(TM) 
(c) 2004 The Gale group. All rts. reserv. 

04507409 SUPPLIER NUMBER: 18275748 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Advanced Web searching: tricks of the trade. (World Wide Web) (includes 
related articles on searching) 

Zorn, Peggy; Emanoil, Mary; Marshall, Lucy; Panek, Mary 
Online, v20, n3, pl4(12) 
May-June, 1996 

ISSN: 014 6-5422 LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 6056 LINE COUNT: 00482 

... weighted retrieval from the database and returns a user's query 

with hits, sorted by relevance ranking . Rankings are determined by the 
location of the word in the document. A keyword located in... 

...keyword located in the document text. The search results include the 
match score, number of links , document title, headings, sample 
abstract, the URL, and length of document. Lycos boasts an index of... 

14/3, K/42 (Item 8 from file: 148) 

DIALOG (R) File 148: Gale Group Trade & Industry DB 
(c)2004 The Gale Group. All rts. reserv. 

09898145 SUPPLIER NUMBER: 20038605 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Information Access Systems Announces Release of ITMS Web Search Server (TM) 

PR Newswire, pl202LATU097 
Dec 2, 1997 

LANGUAGE: English RECORD TYPE: Fulltext 

WORD COUNT: 618 LINE COUNT: 00055 

... in their own words . 

* Search results are returned as a relevance-ranked set of hyper- 
linked pages . The degree of match between each returned document (or 
document segment) and the request is... 
? t!4/3,k/46,49, 53, 55-56 

14/3, K/46 (Item 1 from file: 275) 

DIALOG (R) File 275: Gale Group Computer DB(TM) 
(c) 2004 The Gale Group. All rts. reserv. 

02422513 SUPPLIER NUMBER: 63601451 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Semio Upgrades Semio Taxonomy 4.0. (Product Announcement) 

Intelligent Enterprise, 3, 10, 65 
June 26, 2000 

DOCUMENT TYPE: Product Announcement LANGUAGE: English 

RECORD TYPE: Fulltext 

WORD COUNT: 267 LINE COUNT: 00026 



and refreshes the document list on the same page as the topic 
categories. The new relevance ranking feature lists search results so 
that documents with the strongest participation in the topic category. . . 

...displayed first. Semio's ranking technology is designed to determine key 
concepts in text-based documents and link those concepts to topic 
categories to develop ranked lists, where the most relevant documents for 



14/3, K/49 (Item 4 from file: 275) 

DIALOG (R) File 275: Gale Group Computer DB(TM) 
(c) 2004 The Gale Group. All rts. reserv. 

01705690 SUPPLIER NUMBER: 16271014 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Acrobat 2.0: Adobe moves up market, beyond ad hoc document delivery. (Adobe 
Systems portable document software) (Product Announcement) 

Walter, Mark 

Seybold Report on Desktop Publishing, v9, nl, p3(8) 
Sept 12, 1994 

DOCUMENT TYPE: Product Announcement ISSN: 0889-9762 LANGUAGE: 

ENGLISH RECORD TYPE: FULLTEXT 

WORD COUNT: 6752 LINE COUNT: 004 99 

with a qui ck ascii display. 
Proximity. The proximity feature is used to improve the relevancy 
ranking proces s, according to John Dawes, one of the Acrobat product 
managers. When using this... 

...the terms are within a paragraph or a certain number of words . For 
example, in documents of 12- point type, the internal proximity setting 
may be as coarse a s several pages apart, but the relevance ranking 
takes proximity into account, so that documents in which terms are 
clustered are ranked higher. . . 

14/3, K/53 (Item 8 from file: 275) 

DIALOG (R) File 275: Gale Group Computer DB(TM) 
(c) 2004 The Gale Group. All rts. reserv. 

01253139 SUPPLIER NUMBER: 06888207 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Inside release 3. (Lotus Development Corp's 1-2-3) (includes related 
article on release 3 macros) 

Tucker, Scott 

Lotus, v4, n8, p57(5) 

Aug, 1988 

ISSN: 8756-7334 LANGUAGE: ENGLISH RECORD TYPE: FULLTEXT; ABSTRACT 

WORD COUNT: 3227 LINE COUNT: 00241 

... on worksheets B, C, and D. 

FILE LINKING AND MULTIPLE FILES 

Another new feature is file linking . Formulas can reference not 
only cells in the active files but also cells in files... 

...contains references to a disk file, Relese 3 searches through the disk 
file and brings relevant values into the active file. If you modify a 
file that has other files linked to it, 1-2-3 updates the linked 
files if they are active (in memory) ; if they are not active, 1-2-3 
updates . . . 



14/3, K/55 (Item 2 from file: 636) 

DIALOG (R) File 636: Gale Group Newsletter DB(TM) 
(c) 2004 The Gale Group. All rts. reserv. 

03257494 Supplier Number: 46683517 (USE FORMAT 7 FOR FULLTEXT) 
PLS: PLS speeds time to market for content-rich web sites with PLWeb Turbo 
version 2 . 6 

M2 Presswire, pN/A 
Sept. 4, 1996 

Language: English Record Type: Fulltext 
Document Type: News wire; Trade 
Word Count: 847 

... In response to a query, PLWeb Turbo presents the user with a list 

of hypertext links to "hit" documents listed in order of probable 
relevance to the user, helping users prioritize the search results. The 
relevance - ranking algorithm uses a number of criteria including number 
of hits, proximity of bits in a... 



14/3, K/56 (Item 3 from file: 636) 

DIALOG (R) File 636: Gale Group Newsletter DB(TM) 
(c) 2004 The Gale Group. All rts. reserv. 

03183176 Supplier Number: 46518959 (USE FORMAT 7 FOR FULLTEXT) 
NOVELL: Novell increases networking support expertise on the Internet 

M2 Presswire, pN/A 
July 2, 1996 

Language: English' Record Type: Fulltext 
Document Type: Newswire; Trade 
Word Count: 902 

type and version or by selecting multiple products and versions to 
find more specific information. 

Relevancy ranking determines which documents and information are 
the most likely solution to a query. The search. . . 

...and gives the highest ranking to the document with the most "hits." 
Another consideration in relevancy ranking is how unique a given search 
word is to each document. If the word occurs... 

...names of files contained within compressed files, then sorts the files 
by date, displays the file size, and includes hyperlinks to immediately 
download the files. Users can also browse for files by product and view... 



19/3,K/1 (Item 1 fr^^file: 16) 

DIALOG (R) File 16: Gale Group PROMT (R) 

(c) 2004 The Gale Group. All rts. reserv. 



08577055 Supplier Number: 74105134 (USE FORMAT 7 FOR FULLTEXT) 

Peters picks & pans . 

Jacso, Peter 

Online, v25, n3, p89 

May, 2001 

Language: English Record Type: Fulltext 
Document Type: Magazine/ Journal ; Professional Trade 
Word Count: 2231 



... professional searching, and just type in familiar terms. 

The difference is much bigger when the linguistic and relevance 

ranking wizardry of KF is applied to a query of multiple words. The query 
anaesthesia malpractice... 



19/3, K/2 (Item 2 from file: 16) 

DIALOG (R) File 16: Gale Group PROMT (R) 

(c) 2004 The Gale Group. All rts. reserv. 

06424130 Supplier Number: 54942450 (USE FORMAT 7 FOR FULLTEXT) 
Snap.com Embeds Verity Precision Retrieval in Its Fast-Growing Internet 
Portal . 

Business Wire, p0261 
June 21, 1999 

Language: English Record Type: Fulltext 
Document Type: Newswire; Trade 
Word Count: 958 

... Snap.com uses the Verity Developer's Kit (VDK) as its core 

information processing and linguistic analysis solution, conducting Web 
directory searches and providing relevancy rankings over its directory 
of managed site listings. 

Snap.com is currently the fastest-growing Internet... 



19/3 ,K/3 (Item 3 from file: 16) 

DIALOG (R) File 16: Gale Group PROMT (R) 

(c) 2004 The Gale Group. All rts. reserv. 

05075938 Supplier Number: 47451669 { USE FORMAT 7 FOR FULLTEXT) 
Getting A Jump On The Knowledge -Discovery Market 

Coleman, David 

Computer Reseller News, pl21 

June 9, 1997 

Language: English Record Type: Fulltext 
Document Type: Magazine/ Journal ; Trade 
Word Count: 632 

. . . number of returned responses , 

Verity supports canned topic searches and offers a wide variety of 
linguistically derived topics, ad-hoc queries, and relevance ranking 
through the use of extended or fuzzy Boolean operators. 

Fulcrum Technologies' Surf Server provides several searching... 



19/3, K/5 (Item 5 from file: 16) 

DIALOG (R) File 16: Gale Group PROMT (R) 

(c) 2004 The Gale Group. All rts. reserv. 

04658518 Supplier Number: 46854942 

Testing natural language: comparing DIALOG, TARGET, and DR-LINK. 

Online, p71 
Nov 1, 1996 

Language: English Record Type: Abstract 



Document Type: Magazine/^Pnrnal; Trade 



ABSTRACT: 

...can be entered without commands. DR-LINK is an intelligent text 
retrieval system based on linguistic principles with an English language 
interface, relevance ranking , synonym expansion, concept matching, 
proper noun recognition, disambiguation of terms, and identification of 
noun phrases . . . 



19/3 ,K/ 8 (Item 3 from file: 47) 

DIALOG (R) File 47: Gale Group Magazine DB(TM) 
(c) 2004 The Gale group. All rts. reserv. 

04720186 SUPPLIER NUMBER: 19221702 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Accessing medical information on the Net: time-saving techniques you can 

use for retrieving valuable medical information. (Internet 

resources) (Medical Line) (Column) 

Smith, Stephen E. 

Information Today, vl4, n3, pl2(3) 
March, 1997 

DOCUMENT TYPE: Column ISSN: 8755-6286 LANGUAGE: English 

RECORD TYPE: Fulltext 

WORD COUNT: 1833 LINE COUNT: 0014 7 

yield different results, reflecting their differences in pages 
indexed, words selected for indexing, degree of syntactical evaluation, 
effectiveness of fuzzy logic in relevance ranking , and inclusion or 
exclusion of root words. 

From this come two important timesaving devices in. . . 



19/3, K/9 (Item 4 from file: 47) 

DIALOG (R) File 47: Gale Group Magazine DB{TM) 
(c) 2004 The Gale group. All rts. reserv. 

04641385 SUPPLIER NUMBER: 18848846 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Testing natural language; comparing DIALOG, TARGET, and DR-LINK. 

Feldman, Susan 

Online, v20, n6, p71(8) 

Nov-Dec, 1996 

ISSN: 0146-5422 LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 504 8 LINE COUNT: 0038 6 

... LINK, from Manning and Napier Information Services, is an 

intelligent text retrieval system based on linguistic principles. It 
offers an English language query interface, relevance ranking , concept 
matching, synonym expansion, disambiguation of terms, proper noun 
recognition, and identification of noun phrases... 



19/3,K/10 (Item 5 from file: 47) 

DIALOG (R) File 47: Gale Group Magazine DB(TM) 
(c) 2004 The Gale group. All rts. reserv. 

04248469 SUPPLIER NUMBER: 16897847 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Taming the text tiger: cutting-edge search tools beat 
browsing. (Trends) (Brief Article) 

Rupley, Sebastian 

PC Magazine, vl4, nil, p29(l) 

June 13, 1995 

DOCUMENT TYPE: Brief Article ISSN: 0-888-8-507 LANGUAGE: ENGLISH 

RECORD TYPE: FULLTEXT 

WORD COUNT: 437 LINE COUNT: 00044 

... of Folio Corp., is more skeptical that the PC platform is ready for 

cutting-edge relevancy ranking : " Linguistic and morphological 
analysis are incredible technologies, but only a year ago ConText was 



taking a. 



19/3, K/ll (Item 6 from file: 47) 

DIALOG (R) File 47: Gale Group Magazine DB(TM) 
(c) 2004 The Gale group. All rts. reserv. 

04128687 SUPPLIER NUMBER: 16098240 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Intelligent agents; software servants for an electronic information world 

(and more!) . (autonomous and adaptive computer programs operating within 

software environments) (includes two related articles) 

Roesler, Marina; Hawkins, Donald T. 
Online, vl8, n4, pl8 (11) 
July, 1994 

ISSN: 0146-5422 LANGUAGE: ENGLISH RECORD TYPE : FULLTEXT; ABSTRACT 

WORD COUNT: 6136 LINE COUNT: 00511 

the components used are a natural language query interface, 
statistical processing using vector methods, a semantic network of words 
and word relationships, relevance ranking , query by example, 
linguistic rules and analyses, and Boolean logic. 



19/3,K/13 (Item 1 from file: 148) 

DIALOG (R) File 148: Gale Group Trade & Industry DB 
(c)2004 The Gale Group. All rts. reserv. 

07675341 SUPPLIER NUMBER: 16431093 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Whatever happened to BRS software? (Dataware Technologies Inc. chief 
executive officer Kurt Mueller) (Interview) 

Quint, Barbara 
Searcher, v3, nl, p42(3) 
Jan, 1995 

DOCUMENT TYPE: Interview ISSN: 1070-4795 LANGUAGE: ENGLISH 

RECORD TYPE: FULLTEXT 

WORD COUNT: 1931 LINE COUNT: 00152 

. . . search engine, including BRS/SEARCH and CD Answer. Some of the 

components are natural language, relevance ranking , automatic thesauri 
or synonym lists, semantic networks, statistical thesauri, document 
clustering, document summarization, query by example and automated 
indexing. We demonstrated. . . 

... in Europe to rave reviews . 

The first out will be the natural language and enhanced relevance 
ranking , semantic network, and thesaurus functionality modules. For 
thesauri we will offer content in up to 11... 



19/3, K/14 (Item 2 from file: 148) 

DIALOG (R) File 148: Gale Group Trade & Industry DB 
(c)2004 The Gale Group. All rts. reserv. 

07576484 SUPPLIER NUMBER: 15875813 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Searching natural language systems: searchers know thy engine, (includes 
related article on new natural language search engines) 

Feldman, Susan E. 
Searcher, v2, n8, p34(5) 
Oct, 1994 

ISSN: 1070-4795 LANGUAGE: ENGLISH RECORD TYPE: FULLTEXT 

WORD COUNT: 4390 LINE COUNT: 00349 

. . . term which covers all of the new breeds of search engines, whether 

they rely on relevance ranking alone or on, additional linguistic 
-based processes. However, systems like Westlaw's WIN or Mead Data 
Central's Freestyle, or . . . one 1 s own system. 
SMART 

At 33 years, the SMART system is probably the oldest relevance 



ranking system. Salton '^^search in automatic text retr^^al and 

computational linguistics began in the 1960s, and formed the basis for 
much of the current work. The... 



19/3, K/15 (Item 1 from file: 275) 

DIALOG (R) File 275: Gale Group Computer DB(TM) 
(c) 2004 The Gale Group. All rts. reserv. 



02124507 SUPPLIER NUMBER: 19810435 

Looking for data in all the wrong places, 
sources) (Technology Information) 

Brooks, Peter L. 

DBMS, vlO, nil, p70(6) 

Oct, 1997 

ISSN: 1041-5173 LANGUAGE: English 

WORD COUNT: 4 596 LINE COUNT: 0038 8 



(USE FORMAT 7 OR 9 FOR FULL TEXT) 
(finding data in free -form data 



RECORD TYPE: Fulltext; Abstract 



universal database searching. 
Products using these technologies typically include the use of a 
dictionary, thesaurus, semantic rules concept searching, high-performance 
indexing, and relevance ranking functions that SQL simply does not 
provide. Nondatabase sources, such as Web pages and Microsoft... 



19/3, K/16 (Item 2 from file: 275) 

DIALOG (R) File 275: Gale Group Computer DB(TM) 
(c) 2004 The Gale Group. All rts. reserv. 

01938374 SUPPLIER NUMBER: 18296944 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Fulcrum's SearchServer family: standing out by fitting in. (includes 

related articles about Fulcrum's products and pricing, and partners and 

dealers) (Product Information) 

Banet, Bernard 

Seybold Report on Desktop Publishing, vlO, n9, pl4(8) 
May 20, 1996 

ISSN: 0889-9762 LANGUAGE: English RECORD TYPE: Fulltext 

WORD COUNT: 5532 LINE COUNT: 00456 

techniques such as inverse document frequency to assign higher 
weight to rare terms in calculating relevance rankings . Linguistic 
awareness of the engine broadens searches via word stemming and a 
thesaurus. In SearchServer 3... 



19/3, K/17 (Item 3 from file: 275) 

DIALOG (R) File 275: Gale Group Computer DB(TM) 
(c) 2004 The Gale Group. All rts. reserv. 

01805565 SUPPLIER NUMBER: 16280777 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

IFRA 1 94 recap: Europe adopts Windows, new systems abound, retrieving 

images, (includes related articles on Partner von dem Druck ' s innovative 
drum scanner and an acknowledgment to Apple for the use of its QuickTake 
digital camera) 

Joner, Urban; Karsh, Arlene E.; Neeff, David; Tribute, Andrew 
Seybold Report on Publishing Systems, v24, n5, p3(55) 
Nov 17, 1994 

ISSN: 0736-7260 LANGUAGE: English RECORD TYPE: Fulltext 

WORD COUNT: 4 4 978 LINE COUNT: 03568 

include : 

* User- tunable fuzzy logic. 

* Full-text searches with Boolean or non-Boolean logic and relevance 
ranking . 

* Searches on keywords in text and fielded data. 

* Linguistic -phonetic capabilities. 

* Use of truncated words. 

* Thesaurus — one or more active at one time per... 



19/3, K/18 (Item 1 from file: 636) 

DIALOG (R) File 636: Gale Group Newsletter DB(TM) 
(c) 2004 The Gale Group. All rts. reserv. 

03522703 Supplier Number: 47275450 (USE FORMAT 7 FOR FULLTEXT) 
FULCRUM TECHNOLOGIES: Fulcrum announces Java Developers' Toolkit for rich, 
web -based search apps 

M2 Presswire, pN/A 
April 7, 1997 

Language: English Record Type: Fulltext 
Document Type: News wire; Trade 
Word Count: 945 

phrase, proximity and wildcard searching - along with more advanced 
functionality, such as search-term highlighting, relevance ranking , 
Intuitive Searching (Fulcrum's exclusive similarity searching feature), 
linguistic expansion, natural language searching, and an international 
thesaurus that supports all major European languages. 
* Inherent . . . 

9 
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15/7/2 (Item 2 from file: 2) 



DIALOG (R) File 2 : INSPEC 

(c) 2004 Institution of Electrical Engineers. All rts. reserv. 

6265750 INSPEC Abstract Number: C1999-07-7250N-012 
Title: Collaborating in information space 
Author (s): Alsmeyer, D.; Owston, F. 

Author Affiliation: British Telecom Res. Labs., Ipswich, UK 
Conference Title: Online Information 98. Proceedings p. 31-7 
Editor(s): McKenna, B.; Graham, C; Kerr, J. 
Publisher: Learned Information Europe Ltd, Oxford, UK 
Publication Date: 1998 Country of Publication: UK xix+398 pp. 
ISBN: 1 900871 31 9 Material Identity Number: XX-1998-03642 

Conference Title: Proceedings of Online Information 98 
Conference Sponsor: Information World Review; The Times 
Conference Date: 8-10 Dec. 1998 Conference Location: London, UK 
Language: English Document Type: Conference Paper (PA) 
Treatment: Practical (P) 

Abstract: The paper describes the development of subject based 
information spaces at the BT Labs Digital Library. These information spaces 
are built and maintained both by library professionals and by end users. 
The spaces form a browsable and searchable information source on topics of 
interest to library users. Any library user can create an information space 
for their own use. If they wish, they can publish their space so that other 
users can share this resource. The spaces improve information retrieval and 
navigation because they are subject focused. At present, information spaces 
are fed with content from traditional databases, such as INSPEC. In the 
future, they will be enhanced with information from other sources, such as 
users' bookmark files and the results of Internet searches. Basic 
relevance ranked retrieval has been supplemented by building knowledge 
of the databases' metadata into the search tool. Users are encouraged by 
the interface to browse through the spaces by keyword, classification code 
and other features rather than rely on simple keyword searching. Database 
records are enhanced with links through to the full text of journal 
articles, where these are available, or to the library's disintermediated 
document delivery system where an online version of the article is not 
available. (12 Refs) 

Subfile: C 

Copyright 1999, IEE 



15/7/3 (Item 3 from file: 2) 

DIALOG (R) File 2 : INSPEC 

(c) 2004 Institution of Electrical Engineers. All rts. reserv. 

5242613 INSPEC Abstract Number: C9605-7250-010 

Title: Search and ranking algorithms for locating resources on the World 
Wide Web 

Author(s): Yuwono, B.; Lee, D.L. 

Author Affiliation: Dept. of Comput . & Inf. Sci., Ohio State Univ., 
Columbus, OH, USA 

Conference Title: Proceedings of the Twelfth International Conference on 
Data Engineering (Cat. No . 96CB35888 ) p. 164-71 
Editor (s) : Su, S. Y.W. 

Publisher: IEEE Comput. Soc. Press, Los Alamitos, CA, USA 
Publication Date: 1996 Country of Publication: USA xx+678 pp. 
ISBN: 0 8186 7240 4 Material Identity Number: XX96-00892 

U.S. Copyright Clearance Center Code: 1063-6382/96/$5 . 00 

Conference Title: Proceedings of the Twelfth International Conference on 
Data Engineering 

Conference Sponsor: IEEE Comput. Soc. Tech. Committee on Data Eng 
Conference Date: 26 Feb.-l March 1996 Conference Location: New 



Orleans, LA, USA 

Language: English Document Type: Conference Paper (PA) 
Treatment: Practical (P) 

Abstract: Applying information retrieval techniques to the World Wide Web 
(WWW) environment is a challenge, mostly because of its 
hypertext/hypermedia nature and the richness of the meta-inf ormation it 
provides. We present four keyword-based search and ranking algorithms for 
locating relevant WWW pages with respect to user queries. The first 
algorithm, Boolean Spreading Activation, extends the notion of word 
occurrence in the Boolean retrieval model by propagating the occurrence of 
a query word in a page to other pages linked to it. The second 
algorithm, Most-cited, uses the number of citing hyperlinks between 
potentially relevant WWW pages to increase the relevance scores of the 
referenced pages over the referencing pages. The third algorithm, TFxIDF 
vector space model, is based on word distribution statistics. The last 
algorithm, Vector Spreading Activation, combines TFxIDF with the spreading 
activation model. We conducted an experiment to evaluate the retrieval 
effectiveness of these algorithms. From the results of the experiment, we 
draw conclusions regarding the nature of the WWW environment with respect 
to document ranking strategies. (14 Refs) 

Subfile: C 

Copyright 1996, IEE 



15/7/4 (Item 4 from file: 2) 

DIALOG (R) File 2 : INSPEC 

(c) 2004 Institution of Electrical Engineers. All rts. reserv. 

02944781 INSPEC Abstract Number: C87049463 
Title: Some considerations for approximate optimal queries 

Author (s) : Kwok, K.L. 

Author Affiliation: Queens Coll., City Univ. of New York, Flushing, NY, 
USA 

Conference Title: Proceedings of the Tenth Annual International ACMSIGIR 
Conference on Research and Development in Information Retrieval p. 19-24 
Editor(s): Yu, C.T.; Van Rijsbergen, C.J. 
Publisher: ACM Press, New York, NY, USA 

Publication Date: 1987 Country of Publication: USA vii+317 pp. 
ISBN: 0 89791 232 2 

U.S. Copyright Clearance Center Code: 089791 232 2/87/0006/0019-$00 . 75 
Conference Date: 3-5 June 1987 Conference Location: New Orleans, LA, 
USA 

Language: English Document Type: Conference Paper (PA) 
Treatment: Theoretical (T) 

Abstract: An optimal query has been defined as one which will recover all 
the known relevant documents of a query in their best probability of 
relevance ranking . The author has slightly modified the definition so 

that it also allows one to trace its evolution from the original to the 
optimal via the various feedback stages. Such a query can be constructed by 
modifying the original query with terms from the known relevant documents 
. It is pointed out that such a term addition strategy differs materially 
from other approaches that add terms based on term association with all 
query terms, and calculated from the whole document collection. The effect 
of viewing a document as constituted of components, and hence affecting, the 
weighting and retrieval results of the optimal query, is also discussed. ( 
19 Refs) 
Subfile: C 



15/7/5 (Item 5 from file: 2) 

DIALOG (R) File 2 : INSPEC 



(c) 2004 Institution of Electrical Engineers. All rts. reserv. 

02647107 INSPEC Abstract Number: A86054063, C86025429 
Title: Review journals in physics 
Author(s): Todorov, R. 

Author Affiliation: Centre for Sci. Inf., Bulgarian Acad, of Sci . , Sofia, 
Bulgaria 

Journal: Czechoslovak Journal of Physics, Section B vol.B36, no.l 
p. 157-62 

Publication Date: 1986 Country of Publication: Czechoslovakia 
CODEN: CZYPAO ISSN: 0011-4626 

Language: English Document Type: Journal Paper (JP) 
Treatment: General, Review (G) 

Abstract: Some traditional characteristics of the most important physics 
review journals, such as the number of published articles, subject scope, 
national origin of articles, apparent publication delay, number of pages 
and references are presented. A new measure is used to classify the review 
articles: the number of subspecialties (research fronts) covered and the 
corresponding number of core documents ( relevance weight ) cited . 
These numbers seem to characterize the topicality and interpretativeness of 
review articles. (22 Refs) 

Subfile: A C 



15/7/6 (Item 6 from file: 2) 

DIALOG (R) File 2 : INSPEC 

(c) 2004 Institution of Electrical Engineers. All rts. reserv. 

00211355 INSPEC Abstract Number: C71001582 
Title: A highly associative document retrieval system 
Author (s): Cagan, C. 

Author Affiliation: Washington State Univ., Pullman, WA, USA 
Journal: Journal of the American Society for Information Sciences 
vol.21, no. 5 p. 330-7 

Publication Date: Sept. 1970 Country of Publication: USA 
CODEN: AISJB6 ISSN: 0002-8231 

Language: English Document Type: Journal Paper (JP) 

Abstract: This paper describes a document retrieval system implemented 
with a subset of the medical literature. With the exception of the 
development of a negative dictionary, all system operations are completely 
automatic. Introduced are methods for computation of term-term association 
factors, indexing, assignment of term-document relevance values , and 
computations for recall and relevance . High weights are provided for 
low-frequency terms, and retrieval is performed directly from highly 
connected term- document files without elaboration. Recall and 

relevance are based on quantitative internal system computations, and 
results are compared with user evaluations. (21 Refs) 
Subfile: C 



15/7/9 (Item 1 from file: 35) 

DIALOG (R) File 35 : Dissertation Abs Online 

(c) 2004 ProQuest Inf o&Learning . All rts. reserv. 

1006869 ORDER NO: AAD86-27852 

COMPARING THE RETRIEVAL EFFECTIVENESS OF FREE-TEXT AND CITATION SEARCH 
STRATEGIES IN THE SUBJECT OF TECHNOLOGY PLANNING (BIBLIOGRAPHY, ONLINE 
DATABASES) 

Author: VIDAL-ARBONA, CARLOS 
Degree: PH.D 



Year: 1986 

Corporate Source/Institution: CASE WESTERN RESERVE UNIVERSITY (0042) 
Source: VOLUME 47/08-A OF DISSERTATION ABSTRACTS INTERNATIONAL. 
PAGE 2778. 167 PAGES 

This comparative study addresses the retrieval effectiveness of two 
methods of online searching: subject term and citation searching. Both 
methods were used to search the subject of Technology Planning in a 
collection of articles published by 223 journals. 

The objective of the experiment was to determine: (1) how much overlap 
existed between the articles retrieved by each of the methods; (2) under 
what circumstances each method yielded more relevant articles; (3) if a 
method which consistently retrieves a high percentage of relevant articles 
can be developed. 

Searches were conducted for eight questions on the Dialog Information 
Service against two data bases: (1) AB I /INFORM — a management literature 
file containing bibliographic citations and article summaries; (2) Social 
SciSEARCH — a citation index to the literature of the Social Sciences. 

Each of the questions was searched twice: one via subject terms on 
ABI/INFORM, and the second time via citations on Social Scisearch. 
Practitioners in Technology Planning evaluated the search results for 
relevance . Effectiveness scores were calculated based on relevance 
judgements performed by experts on how similar the retrieved results were 
to two known relevant "seed" articles they submitted for the experiment. 

The study results demonstrated that subject term searches satisfied 
the information requirement of higher recall with reasonable precision 
preferred by the experts who participated in the investigation. Citation 
was found to be an effective method when "seed" documents cited 
significant prior work. Additionally, it was shown that the number of 
common documents retrieved by both methods is very small. And that both 
methods are complementary. This finding is consistent with prior research 
and implies that subject bibliographies assembled by employing one method 
alone miss a large proportion of relevant articles. 



15/7/10 (Item 1 from file: 144) 

DIALOG (R) File 14 4: Pascal 

(c) 2004 INIST/CNRS. All rts. reserv. 

14104888 PASCAL No.: 99-0299011 

Linguistic searching versus relevance ranking : DR-link and target 

JONES K 

University of Tennessee, School of Information Sciences, 804 Volunteer 
Blvd., Knoxville, TN 37996, United States 

Journal: Online & CDROM review, 1999, 23 (2) 67-80 

ISSN: 1353-2642 Availability: INIST-17093; 354 000084506610010 

No. of Refs. : 11 ref . 

Document Type: P (Serial) ; A (Analytic) 
Country of Publication: United Kingdom 
Language: English 

Limitations of traditional Boolean searching are claimed to be overcome 
by two alternative search systems: DR-LINK, a linguistic search system, and 
TARGET, a relevance ranking system. This paper compares the system and 
search features of both and describes conceptual differences in system 
design. A series of test questions was developed to test the retrieval 
effectiveness of both search systems. A controlled dataset was used to 
measure the results. System features are compared and discussed. Relevance 
overlap and search capabilities are evaluated and results are presented. 



Copyright (c) 1999 INIST-CNRS. All rights reserved. 



15/7/11 (Item 1 from file: 202) 

DIALOG (R) File 202: Info. Sci. & Tech. Abs . 
(c) 2004 EBSCO Publishing. All rts. reserv. 

3603344 
Teoma . com . 

Author (s): Bates, Mary Ellen 

Author URL: http://www.batesinfo.com 

Corporate Source: Bates Information Services, Washington, DC 
Online vol. 25, no. 5, pages 96 
Publication Date: September/October 2001 
ISSN: 0146-5422 

Journal URL: http://www.onlineinc.com/onlinemag 

Publisher URL: http://www.onlineinc.com 

Language: English 

Document Type: Journal Article 

Record Type: Abstract 

Journal Announcement: 3608 

Presents a highly favorable review of the startup search engine Teoma 
<www.teoma.com>, which works "like a champ," with the look and feel of 
Google.com. Teoma Technologies is a new company funded by Hawk Holdings and 
uses a technology developed at Rutgers University, which has "parametric 
mathematical algorithms designed to tailor a search to locate the most 
pertinent information on the Internet." What it does is rank search results 
based on a link analysis of just the retrieved Websites, which presumably 
is a more accurate gauge of relevance than ranking search results by a 
calculation of how frequently they are linked to by all Websites (as is 
done by Google). Explains how Teoma.com works in practice, and describes 
some of its many features and benefits. 



15/7/13 (Item 3 from file: 202) 

DIALOG (R) File 202: Info. Sci. & Tech. Abs. 
(c) 2004 EBSCO Publishing. All rts. reserv. 

0502747 

A highly associative document retrieval system. 

Author(s): Cagan, Carl 

Corporate Source: Washington State University, Pullman. 

Journal of the American Society for Information Science vol. 21, no. 5 
, pages 330-337 

Publication Date: September-October 1970 

ISSN: 0002-8231 

Language: English 

Document Type: Journal Article 

Record Type: Abstract 

Journal Announcement: 0500 

This paper describes a document retrieval system implemented with a 
subset of the medical literature. With the exception of the development of 
a negative dictionary, all system operations are completely automatic. 
Introduced are methods for computation of term-term association factors, 
indexing, assignment of term-document relevance values , and 
computations for recall and relevance . High weights are provided for 
low-frequency terms, and retrieval is performed directly from highly 
connected term- document files without elaboration. Recall and 
relevance are based on quantitative internal system computations, and 
results are compared with user evaluations. 
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17/3, K/2 (Item 2 frdlrfile: 15) 

DIALOG (R) File 15 : ABI /Inform (R) 

(c) 2004 ProQuest Inf o&Learning . All rts. reserv. 



02358307 117541737 

Jamaica: a World Wide Web profiler 

Ho, Chee-Wai; Goh, Angela 

Internet Research v9n2 PP: 129-139 1999 
ISSN: 1066-2243 JRNL CODE: NTRS 
WORD COUNT: 4 597 



...TEXT: by that search engine, 
shown under each RS { Relevance 
documents whose relevance coincided 
we . . . 



With reference to Table IV, the number 
Score ) represents the number of 
with the scale. For sampling purposes, 



. . . where A represents the number of erroneous links and B represents the 
number of duplicated documents / links . To ensure unbiased scores, 
results returned from search engines were graded subjectively with no 
preferences . . . 

... graded ROS[sub]i taking into account its ranking position. ROS[sub]i is 
the relevance ordering score for the i[sup]th document as well as its 
ranked position within the Max. . . 



17/3, K/3 (Item 3 from file: 15) 

DIALOG (R) File 15 : ABI/Inf orm (R) 

(c) 2004 ProQuest Inf o&Learning . All rts. reserv. 

02325917 86925450 

The evolution of Web searching 

Green, David 

Online Information Review v24n2 PP: 124-137 2000 
ISSN: 1468-4527 JRNL CODE: ONCD 
WORD COUNT: 8 4 87 

...TEXT: similar guidelines, with the location and frequency of words the 
primary determining factors in results relevance ranking . However, 
during 1998, a number of new search engine providers appeared. These' 
companies built their... 

. . . the "popularity" of a Web site, the Google index is based on the number 
of links between pages and sites, whilst the Real Names index is a 
pay-for service that enables companies. . .processing the text around each 
hyperlink (Green, 1999a) . 

Links-based analysis does feature in the relevance ranking algorithms 
of some search engine providers such as Excite and HotBot. However, Google 
is the . . . 

. . . wide searching. The company estimates that its index is between 70 
million and 100 million pages , but, through the links analysis, enables 
users to reach an estimated 300 million Web pages. Google's combination of 



17/3, K/4 (Item 4 from file: 15) 

DIALOG (R) File 15 : ABI/Inf orm (R) 

(c) 2004 ProQuest Inf o&Learning . All rts. reserv. 

02119816 -67261260 
Internet Librarian 2000 

Hane, Paula J 

Information Today vl8nl PP: 1, 46 Jan 2001 
ISSN: 8755-6286 JRNL CODE: I FT 
WORD COUNT: 164 0 



...TEXT: they're valid ^^earch papers. It uses both cit^Mbn analysis and 

.Se.mani.ijc — meas.ur.e s for ^s relevance ranking o f^"Te^ff e ved~ doTcument's-r- 
Researchlndex provides users with bibliographic information, author 
information, a link to an author's page, an abstract of the document, the 
context of citations ( document cited by and similar documents ) , and 
citations within the document with links to those. For now, the subject 
is just computer science, but the engine gives us... 



17/3, K/6 (Item 6 from file: 15) 

DIALOG (R) File 15 : ABI/Inf orm (R) 

(c) 2004 ProQuest Inf o&Learning . All rts. reserv. 
01965824 47033808 

Northern Light adds link popularity to its relevance ranking factors list 

Feldman, Susan 

Information Today vl6nll PP: 36 Dec 1999 
ISSN: 8755-6286 JRNL CODE: I FT 
WORD COUNT: 554 

...TEXT: Search engines add a weight for each of these sorts of measures 
when calculating the relevance ranking of each document. Documents that 
have more occurrences of the query terms, and more terms... 

. . . still higher, and so does the semantic and syntactic context of the 
query and the document . 

By adding link popularity, Northern Light has mined the implicit 
reviewing mechanism of other Web authors, who link. . . 



17/3, K/7 (Item 7 from file: 15) 

DIALOG (R) File 15 : ABI/Inf orm (R) 

(c) 2004 ProQuest Inf o&Learning . All rts. reserv. 
01835558 04-86549 

High AJeevers: Valet-added searching from Ask Jeevers 

Basch, Reva 

Database v22n3 PP: 28-34 Jun/Jul 1999 
ISSN: 0162-4105 JRNL CODE: DTB 
WORD COUNT: 2 932 

...TEXT: of Web search tools, in which the deliverable is more than just a 
raw, supposedly relevance - ranked accepts a natural language query and 
attempts to match it against a list of known... 

...as a metasearch engine, too). Each question is associated with an answer 
template consisting of links to specific Web pages that Jeeves' 
editorial staff more about them shortly-have scouted and selected. Ask 
Jeeves can . . . 



17/3, K/10 (Item 10 from file: 15) 

DIALOG (R) File 15 : ABI/Inf orm (R) 

(c) 2004 ProQuest Inf o&Learning . All rts. reserv. 

01812094 04-63085 

The future of Web search 

Sherman, Chris 

Online v23n3 PP: 54-61 May/Jun 1999 
ISSN: 0146-5422 JRNL CODE: ONL 
WORD COUNT: 4 411 

...TEXT: strives to locate authoritative sources ("authorities") on the 
Web, and use the information to compile relevance rankings . Google, a 
new search engine developed at Stanford, is on the forefront of this work. 
"Google measures link importance, " said Larry Page , co-founder of 
Google. A link to another Web page is almost like a citation in a book. 



Web page authors gener^^^ only create links to other ^Bges they think 
are important. Web autWs have created thousands of li^ra to Yahoo!, for 
example. . . 



...an important site. Yahoo! becomes even more of an important site if lots 
of other pages with high importance point to it. "It's almost like 
having a peer review process for Web pages," according... 



17/3, K/13 (Item 13 from file: 15) 

DIALOG (R) File 15 : ABI/Inf orm (R) 

(c) 2004 ProQuest Inf o&Learning . All rts. reserv. 
01448847 00-99834 

Internet search techniques and strategies 

Notess, Greg R 

Online v21n4 PP: 63-66 Jul/Aug 1997 
ISSN: 0146-5422 JRNL CODE: ONL 
WORD COUNT: 2 927 

...TEXT: the strangest page may provide a link to a reliable and 
authoritative information source. 

The relevancy ranking algorithms employed by all the major Internet 

databases are intended to help bring the most... 

... relevant show up first. Unfortunately, with the diversity and variable 
quality of Internet resources, the relevancy ranking algorithms often 
fail to lift the most relevant hits to the top. Try a multiple... 

...look for the best of the top ten or so hits, then see if those pages 
link to more relevant sites. Sometimes it takes going four or five levels 
deep to find. . . 
?t!7/3, k/25-28 

17/3, K/25 (Item 4 from file: 484) 

DIALOG (R) File 4 84 : Periodical Abs Plustext 
(c) 2004 ProQuest. All rts. reserv. 

02314451 (USE FORMAT 7 OR 9 FOR FULLTEXT) 

SciSearch on STN: Unique features for sophisticated searching 

Huber, Charles F 

Database (DTB) , vl8 n2, p52-62, p. 9 
Apr 1995 

ISSN: 0162-4105 JOURNAL CODE: DTB 

DOCUMENT TYPE: Feature 

LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 3379 LENGTH: Long (31+ col inches) 

TEXT: 

for SORT OCC. Since the cited reference field (RE) is 
a highlighted field, this allows relevancy ranking of articles 
based on the number of cited references hit. Combined with other features, 
this. . . 

...pulls the first author's name, publication year, volume and first page 
information from a record and creates a cited reference search term out 
of them, allowing quick determination of which records cite a given 
record or records found in, for instance, a subject or author search. 
Most exciting of all... 



17/3, K/26 (Item 1 from file: 813) 

DIALOG (R) File 813: PR Newswire 

(c) 1999 PR Newswire Association Inc. All rts. reserv. 



1194170 LATU097 

Information Access Systems Announces Release of ITMS Web Search Server (TM) 



DATE: December 2, 1997 




19:31 EST 



WORD COUNT 




70 



...choose, a concept or subject in their 



own words . 



Search results are returned as a 
linked pages . The degree of match 
document segment) and the request is... 



relevance - ranked set of hyper - 
between each returned document (or 



17/3,K/27 (Item 2 from file: 813) 

DIALOG (R) File 813: PR Newswire 

(c) 1999 PR Newswire Association Inc. All rts. reserv. 



0990827 



DCW009 



PLS Speeds Time to Market for Content-Rich Web Sites With PLWeb Turbo 
Version 2 . 6 



...In response to a query, PLWeb Turbo presents the user with a list of 
hypertext links to "hit" documents listed in order of probable 
relevance to 

the user, helping users prioritize the search results. The relevance - 
ranking 

algorithm uses a number of criteria including number of hits, proximity of 
hits in a . . . 



17/3, K/28 (Item 1 from file: 610) 

DIALOG (R) File 610: Business Wire 

(c) 2004 Business Wire. All rts. reserv. 

00440874 20010112012B9283 (USE FORMAT 7 FOR FULLTEXT) 

Search Engines Take Quantum Leap: 19 out of 20 Now Use Link Popularity To 
Determine Relevancy; WebSeed Publishes Chart 

Business Wire 

Friday, January 12, 2001 12:50 EST 

JOURNAL CODE: BW LANGUAGE: ENGLISH RECORD TYPE: FULLTEXT 
DOCUMENT TYPE: NEWSWIRE 
WORD COUNT: 394 

TEXT: 

...pages. Engines that rely on it, like Google and Northern Light, 
consistently receive the highest relevancy ratings from publications 
like PC 

Magazine (source: PC Magazine, "Search Engines At A Glance," November 15... 

...works: Rather than relying on meta tags or keywords, a 

search engine assumes that a page receiving hyperlinks from other 

pages is an 

important page and therefore deserves to be ranked higher. Today's link 
popularity. . . 

...and consider the link popularity of 

each originating page, meaning that when a highly-popular page points 
to 

another page , it makes the target page popular, too. 
?tl7/3,k/33,37,40,42 

17/3, K/33 (Item 1 from file: 20) 

DIALOG (R) File 20: Dialog Global Reporter 
(c) 2004 The Dialog Corp. All rts. reserv. 

18079689 {USE FORMAT 7 OR 9 FOR FULLTEXT) 

Use of AI for Employment Recruiting Likely to Become Mainstream with 
Delivery of HCMPro Recruiter From Thomas International 

BUSINESS WIRE 
July 30, 2001 



DATE: September 4, 1996 



09:04 EDT 



WORD COUNT: 



831 



JOURNAL CODE: WBWE LP^mkGE: English RECORD TYPE: ^BlTEXT 
WORD COUNT: 961 



(USE FORMAT 7 OR 9 FOR FULLTEXT) 

. . . links, then duplicates and reads each document word-for-word in 

order to generate a relevancy score based on the search request. The 
resumes of qualified candidates can then be viewed, saved. . . 

17/3, K/37 (Item 5 from file: 20) 

DIALOG (R) File 20: Dialog Global Reporter 
(c) 2004 The Dialog Corp. All rts. reserv. 

03181618 

Google eyes Yahoo 1 s search engine market 

SECTION TITLE: News 
John Geralds in Silicon Valley 
NEWSWIRE (VNU) 
October 20, 1998 

JOURNAL CODE: WNEW LANGUAGE: English RECORD TYPE: FULLTEXT 
WORD COUNT: 381 

... other 'important 1 Web sites. "You're the sum of the importance of 
the things that point to you, " said Page . He added that other search 
engines use linking relationships to build indexes, but do not... 

... Web. 'Clever' is being developed at IBM's Almaden Research Center. Like 
Google, it ranks pages by calculating links between them and measuring 
'importance', but it does not crawl the Web. Direct Hit also... 

17/3, K/40 (Item 8 from file: 20) 

DIALOG (R) File 20: Dialog Global Reporter 
(c) 2004 The Dialog Corp. All rts. reserv. 

01977 657 (USE FORMAT 7 OR 9 FOR FULLTEXT) 

Science and Technology: Hits and misses: A new type of Internet search 
engine may soon sort the electronic wheat from the chaff 

ECONOMIST 
June 20, 1998 

JOURNAL CODE: FECN LANGUAGE: English RECORD TYPE: FULLTEXT 
WORD COUNT: 1230 

(USE FORMAT 7 OR 9 FOR FULLTEXT) 

... dead ends 1 .) In these recalculations, each hypertext link to andA 

fro is multiplied by the relevant weight of the document it is 

pointing to or from to create a weighted link value. New overall weights 
for the page . . . 

J 

17/3, K/42 (Item 1 from file: 674) 

DIALOG (R) File 674: Computer News Fulltext 

(c) 2004 IDG Communications. All rts. reserv. 

071657 

Welcome to the Machine: The Quest for Stuff 

Byline: Mark Gibbs 

Journal: Network World Page Number: 34 

Publication Date: January 25, 1999 
Word Count: 5 68 Line Count: 53 

Text : 

... of the results that clinches it.BullsEye retrieves search results from 
the search engines, retrieves pages they cite , analyzes the page 

content, removes the dead links and presents a report . And when I say 
"presents," I mean it can display the report in a browser... 



...sites that have been ^^iewed. Third comes the Rapid Information Discover 
Engine, which determines the relevance ranking of documents, removes 
duplicates and dead links, and summarizes the content. And finally, there's 
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(c) 2004 JPO & JAPIO 
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10/9/1 (Item 1 from file: 350) 

DIALOG (R) File 350:Derwent WPIX 

(c) 2004 Thomson Derwent . All rts. reserv. 

015949912 **Image available** 
WPI Acc No: 2004-107753/200411 
XRPX Acc No: N04-085674 

Content relevancy rating changing program e.g. for help file, sorts 

contents retrieved during database search process, based on relevancy 

rating of content , for output 
Patent Assignee: FUJITSU LTD (FUIT ) 
Inventor: SHIBATA R 

Number of Countries: 002 Number of Patents: 002 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20040002945 Al 20040101 US 2002305178 A 20021127 200411 B 

JP 2004030526 A 20040129 JP 2002189753 A 20020628 200411 

Priority Applications (No Type Date) : JP 2002189753 A 20020628 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20040002945 Al 17 G06F-007/00 
JP 2004030526 A 17 G06F-017/30 

Abstract (Basic) : US 20040002945 Al 

NOVELTY - The program has a module which acquires information of 
active software during content database search process, for storage in 
a currently active software management table. A relevancy rating of 

content retrieved during search process is calculated and added with 
a prescribed value, for storage in a management table. The retrieved 
contents are sorted based on relevancy rating, for output. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following: 

(1) recording medium for storing content relevancy rating 
changing program; and 

(2) content relevancy rating changing method. 

USE - For changing relevancy rating assigned to content such 
as help files, case e.g. files, and question and answer (Q and A) files 
stored on computer or content server. 

ADVANTAGE - Efficiently sorts/ranks the content retrieved from 
database based on relevancy rating of the content . 

DESCRIPTION OF DRAWING (S) - The figure shows the architecture of 
search results rank changing processing program. 
. pp; 17 DwgNo 1/12 
Title Terms: CONTENT; RATING; CHANGE; PROGRAM; HELP; FILE; SORT; CONTENT; 

RETRIEVAL; DATABASE; SEARCH; PROCESS; BASED; RATING; CONTENT; OUTPUT 
Derwent Class: T01 

International Patent Class (Main) : G06F-007/00; G06F-017/30 
International Patent Class (Additional) : G06F-015/00 
File Segment: EPI 

Manual Codes (EPI/S-X) : T01-J05B1; T01-J05B4M; T01-N03A2; T01-S03 



10/9/2 (Item 2 from file: 350) 

DIALOG (R) File 350: Derwent WPIX 

(c) 2004 Thomson Derwent. All rts. reserv. 



015394679 **Image available** 
WPI Acc No: 2003-456820/200343 
XRPX Acc No: N03-363315 

Relevance rank determination method for web page, involves adjusting 

content -based relevance rank , based on link structure of pages 

including link rank values from in-coming links 
Patent Assignee: ALPHA S A (ALPH-I) 
Inventor: ALPHA S A 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20030061214 Al 20030327 US 2001928962 A 20010813 200343 B 



Priority Applications (No Type Date) 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20030061214 Al 11 G06F-007/00 



US 2001928962 A 20010813 



Abstract (Basic) : US 20030061214 Al 

NOVELTY - A content -based relevance rank is determined for 
each of the pages based on a content of each page. The relevance 
rank is adjusted based on a link structure of the pages including link 
rank values from in-coming links. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(1) system for determining a relevance rank determination system; 

and 

(2) candidate pages set ranking method. 

USE - For determining relevance rank of web page in Internet and 
intranet . 

ADVANTAGE - Improved rankings for web pages are obtained based on 
the given search query. Relevance rankings are based on linguistically 
aware link analysis where link values incorporate content -based 
relevance values of associated pages as a function of the page link 
structure . 

DESCRIPTION OF DRAWING (S) - The figure shows a flow chart 
explaining the relevance rank determination process. 

pp; 11 DwgNo 3/4 

Ti£J s eJ£rms; iM . RELEVANT ./RANK; DETERMINE; METHOD; $EBp 
C^^^^RKLEVJ^^'^^KUBASED; LINK; STRUCTURE; PAGE;* 
LINK 

Derwent Class: T01 

International Patent Class (Main) : G06F-007/00 
File Segment: EPI 

Manual Codes (EPI/S-X) : T01-E01A; T01-N03A2 




CONTENTS; 
ffirtJE'T"' 



10/9/4 (Item 4 from file: 350) 

DIALOG (R) File 350: Derwent WPIX 

(c) 2004 Thomson Derwent. All rts. reserv. 

013491837 **Image available** 

WPI Acc No: 2000-663780/200064 

XRPX Acc No: N00-491806 

Computerized document ranking method for world wide web, involves pruning 
graph representing a node, if weight associated with particular node of 
another subset is less than threshold 

Patent Assignee: ALTAVISTA CO (ALTA-N) 

Inventor: BHARAT K A; HENZINGER M R 

Number of Countries: 001 Number of Patents: 001 



Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 6112203 A 20000829 US 9858577 A 19980409 200064 B 

Priority Applications (No Type Date) : US 9858577 A 19980409 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 6112203 A 9 G06F-017/21 

Abstract (Basic) : US 6112203 A 

NOVELTY - Some documents are selected to form topic and relevance 
weight is assigned to each node which represents document in graph 
based on document similarity. A subset of documents is re-selected, to 
prune a specific node. If associated weight of that node is less than 
threshold, pruned graph representing another subset of document is 
formed in which documents are ranked using connectivity based ranking 
scheme . 

DETAILED DESCRIPTION - A set of documents include pages provided by 
a search engine in response to a query. The set of documents includes 
pages directly linked to the produced pages. 

USE - For ranking set of computerized documents based on content 
and connectivity of documents in world wide web (WWW) . 

ADVANTAGE - During content analysis phase, a relevant weight 
is assigned to chosen subset of the nodes in the graph carefully. The 
relevance weight of document is increased when the document includes 
words that are terms of the query. The graph is pruned to eliminate 
nodes whose relevance weights are less than preset threshold. During 
connectivity analysis phase, remaining nodes of the pruned graph are 
then scored according to their connectivity to determine normalized hub 
and authority scores for the documents, thereby ranking the documents. 

DESCRIPTION OF DRAWING (S) - The figure shows the flow diagram of 
process for distilling query topic, 
pp; 9 DwgNo 2/3^-* 
Title Terms: DOCUMENT ;< ^RA% / METHOD; WORLD; WIDE; WEB; PRUN£U-^RAPHc 
REPRESENT; NODE; WEIGHT; ASSOCIATE; NODE; SUBSET; LESS; ^HRESHOLD^ 

Derwent Class: T01 ^ ™ — — 

International Patent Class (Main) : G06F-017/21 
File Segment: EPI 

Manual Codes (EPI/S-X) : T01-E01B; T01-H07C5E; T01-J05B1; T01-J11C 

9 



0 



14/9/1 (Item 1 from file: 350) 

DIALOG (R) File 350:Derwent WPIX 

(c) 2004 Thomson Derwent . All rts. reserv. 

015683770 **Image available** 

WPI Acc No: 2003-745959/200370 

XRPX Acc No: N03-597642 

Subject-specif ic information compiling and accessing method in computer 
network, involves filtering contents of each site twice by using 
predetermined technique to determine relevancy of contents 

Patent Assignee: JACOBSEN B (JACO-I); SOREN A (SORE-I) ; MORTENSEN S A 
(MORT-I) 

Inventor: JACOBSEN B; SOREN A; MORTENSEN S A 
Number of Countries: 032 Number of Patents: 002 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20030163454 Al 20030828 US 200282354 A 20020226 200370 B 

EP 1341099 A2 20030903 EP 2003250903 A 20030213 200370 

Priority Applications (No Type Date) : US 200282354 A 20020226 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20030163454 Al 24 G06F-007/00 
EP 1341099 A2 E G06F-017/30 

Designated States (Regional) : AL AT BE BG CH CY CZ DE DK EE ES FI FR GB 

GR HU IE IT LI LT LU LV MC MK NL PT RO SE SI SK TR 

Abstract (Basic) : US 20030163454 Al 

NOVELTY - Links are traversed between sites on the computer 
network. The content of each accessed site are filtered twice either by 
presenting the contents to a human editor or by passing the contents 
through a lexicon based filter, to determine relevancy of the contents. 
The contents of sites determined to be relevant are indexed. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(1) computer readable medium storing subject specific information 
compiling and accessing program; 

(2) subject specific information compiling and accessing system; 

and 

(3) web page information relevance information ranking method 
of database. 

USE - For compiling and accessing subject-specific information e.g. 
law and medicine from computer network such as internet, intranet, 
local area network and wide area network. 

ADVANTAGE - Filters and discards irrelevant web pages and ignores 
links to discarded web pages . 

DESCRIPTION OF DRAWING (S) - The figure shows an explanatory view of 
the specific subject information compiling and accessing method. 

pp; 24 DwgNo 3B/8 

Title Terms: SUBJECT; SPECIFIC; INFORMATION; COMPILE; ACCESS; METHOD; 

COMPUTER; NETWORK; FILTER; CONTENT; SITE; TWICE; PREDETERMINED; TECHNIQUE 

; DETERMINE; CONTENT 
Derwent Class: T01; W01 

International Patent Class (Main) : G06F-007/00; G06F-017/30 
File Segment: EPI 

Manual Codes (EPI/S-X) : T01-E; T01-S03; W01-A06B5A 



14/9/2 (Item 2 from file: 350) 

DIALOG (R) File 350: Derwent WPIX 



0 



(c) 2004 Thomson Derwent . All rts. reserv. 

015573228 **Image available** 

WPI Acc No: 2003-635385/200360 

XRPX Acc No: N03-505363 

Inferred relation weighting method for ranking search results of Internet 
search engine, involves determining strength of links between Internet 
objects and common object to determine relevancy score of inferred 
relation 

Patent Assignee: BALAS UBRAMAN I AM S ( BALA- 1 ) ; DOROSARIO A (DORO-I); GUIDI J 

(GUID-I); KOLLURI V (KOLL-I); KOTLOV A (KOTL-I) 
Inventor: BALAS UBRAMAN I AM S; DOROSARIO A; GUIDI J; KOLLURI V; KOTLOV A 
Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20030101286 Al 20030529 US 2001902421 A 20010710 200360 B 

Priority Applications (No Type Date) : US 2001902421 A 20010710 
Patent Details: 
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Abstract (Basic) : US 20030101286 Al 

NOVELTY - The strength of respective links (38,42) between the 
Internet objects (02,01) and a common object (D2) are determined. The 
relevancy score of inferred relation between the Internet objects is 
determined, based on the link strengths. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for 
inferred relation weighing program. 

USE - For determining relevancy score of inferred relation 
between Internet objects such as query and Internet document or query 
and transaction record for ranking search results of Internet search 
engine . 

ADVANTAGE - Enhances the relevance of list of documents generated 
in response to query and predicts or suggests addition search terms 
required by the user to include in the query. 

DESCRIPTION OF DRAWING (S) - The figure shows the inferred relation 
weighting process. 

links (38,42) 

Internet objects (Q2,D1) 

common object (D2) 
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Database arranging system determines link to specific website based 



on corresponding relevancy weight assigned to term in received 
question 
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Priority Applications (No Type Date): US 2001792460 A 20010223 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20020152192 Al 12 G06F-007/00 

Abstract (Basic) : US 20020152192 Al 

NOVELTY - A host system (10) generates group of keywords based on 
specific ranking. The host system receives a search question includes a 
term and determines whether the term corresponds to one of the keyword 
groups. A corresponding relevancy weight is assigned to the term, 
based on the determined result. A link to a website is determined 
based on the assigned weight and the determined link is presented. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are included for the 
following: 

(1) Method for database arranging corresponding to website accessed 
by user; and 

(2) Recording medium storing computer program for arranging 
database . 

USE - For arranging database corresponding to website selling 
electronic equipment such as camera, computer, CD players and other 
websites . 

ADVANTAGE - Enables user to search relevant information quickly and 
accurately. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
the database arranging system. 
Host system (10) 
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Hypertext database traversing method for retrieving an electronic 
document in the hypertext database of a computer system 
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US 6067552 A 23 G06F-017/30 CIP of application US 95517136 

Abstract (Basic) : US 6067552 A 

NOVELTY - A list of electronic documents (108a-108c) is produced 
based on the result of comparison between a set of relevant index term 
values and sets of descriptive index terms (212a-212h) . The list is 
ranked with reference to the relevancy of each document with respect to 
a user, based on the weighted relevancy ranking . Each document in 
the list contains at least one relevant index term value. 

DETAILED DESCRIPTION - A weighted relevancy ranking of each 
descriptive index term (212a-212h) and a set of relevant index term 
values are received. The set of relevant index term values are compared 
with the sets of descriptive index terms using the weighted relevancy 

ranking . An INDEPENDENT CLAIM is also included for a 
computer-readable medium. 

USE - For retrieving an electronic document in the hypertext 
database of a computer system. 

ADVANTAGE - Reduces set of relevant links, and enables 
incorporation of expert knowledge of past relevance to determine 
present relevance of documents. Reduces elapsed user time for 
traversing the database. Allows user to control trade-off between 
complexity and number of intermediate links to the relevant 
documents . 

DESCRIPTION OF DRAWING (S) - The figure shows the diagram of a 
hypertext document database in which the documents are tagged with 
index terms . 

Documents (208a-208c) 

Descriptive index terms (212a-212h) 
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Abstract (Basic) : US 6055542 A 

NOVELTY - The relevance rating for each data section contained 
in a web page, are computed to display the data section in a 
predetermined order. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following: 

(a) a data handling system; 

(b) and a computer readable medium. 

USE - For client computer connected to computer network. 

ADVANTAGE - Sorts data in web page according interest of user. Does 
not require reading of entire web page. Enables user to quickly spot 
area of interests within document . Moves desired point of interest 
within web page. 

DESCRIPTION OF DRAWING (S) - The figure shows the flowchart of a web 
page rearrangement . 
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Abstract (Basic) : US 5535382 A 

The document retrieval system retrieves registered documents from a 
document database responsive to retrieval conditions designated by a 
user. The document retrieval system includes a query converter for 
converting the retrieval condition designated by the user into a query 
which has a predetermined normal form in which keywords and at least 
one type of logical operation out of logical operations AND, OR and NOT 
are connected, a bibliographical information indicator for indicating a 
relation between each of the registered documents and keywords and a 
keyword connection table having relationship values, each of the 
relationship values representing the degree of relationship between 
each two keywords . 

The document retrieval system also includes a selector for 



# 



referring the inverted file and the keyword connection to select 
registered documents which satisfy the query, and an outputting 
circuit for outputting registered documents selected by the selector. 

ADVANTAGE - Documents are given in order starting from document 
closest to user's requirements to enable flexible and quick retrieval. 
Dwg. 7/8 

Title Terms: DOCUMENT; RETRIEVAL; SYSTEM; RANK; DOCUMENT; RELEVANT; VALUE; 

INDICATE; DOCUMENT; QUERY; RETRIEVAL; CONDITION; CONJUNCTION; NORMAL ; 

QUERY; SUB; QUERY; SUB; RELEVANT; VALUE 
Derwent Class: T01 

International Patent Class (Main) : G06F-017/30 
International Patent Class (Additional) : G06F-015/18 
File Segment: EPI 

Manual Codes (EPI/S-X) : T01-J05B3 



File 34 8 : EUROPEAN PATENTS 1978-2004 /Mar W02 

(c) 2004 European Patent Office 
File 349: PCT FULLTEXT 1979-2002/UB=20040318 , UT=20040311 





(c) 


2004 WIPO/Univentio 


Set 


Items 


Description 


SI 


943761 


CONNECTION? OR CONNECTING? OR CONNECTED OR CONNECT? ? OR I- 




NTERCONNECT ? 


S2 


151176 


CITE OR CITES OR CITED OR CITING 


S3 


816253 


POINT OR POINTS OR POINTED OR POINTER? ? OR POINTING 


S4 


401587 


LINK? OR HYPERLINK? OR HOTLINK? OR LIVELINK? OR INTERLINK? 


S5 


1554 


RELEVAN? ( 1W) (RANK? OR SCORE OR SCORES OR VALUATION? OR RAT- 






ING? OR WEIGHT? ? OR VALUE OR VALUES) 


S6 


87867 


S1:S4(3N) (PAGE OR PAGES OR WEB PAGE? OR WEBSITE? ? OR DOCUM- 



ENT? ? OR ARTICLE? ? OR RECORD? ? OR REPORT? ? OR FILE OR FIL- 
ES) 



S7 


22013 


S1:S4 (3N) OBJECT? ? 


S8 


29 


S5 (10N) CONTENT 


S9 


29 


ID PAT (sorted in duplicate/non-duplicate order) 


S10 


27 


ID PAT (primary/non-duplicate records only) 


Sll 


15 


S5(25N)S6:S7 


S12 


12 


Sll NOT S8 


S13 


12 


IDPAT (sorted in duplicate/non-duplicate order) 


S14 


12 


IDPAT (primary/non-duplicate records only) 



? tl0/6/12;tl0/5,k/2,4, 9-10, 14-16 



10/6/12 (Item 12 from file: 349) 

01066449 **Image available** 

CAPSULES FOR DRY POWDER INHALERS AND METHODS OF MAKING AND USING SAME 
CAPSULES POUR INHALATEURS A POUDRE SECHE ET PROCEDES DE FABRICATION ET 
D 1 UTILISATION CORRE SPONDANTS 

Publication Language: English 
Filing Language: English 
Fulltext Availability: 

Detailed Description 

Claims 

Fulltext Word Count: 12054 
Publication Year: 2003 

10/5 ,K/2 (Item 2 from file: 348) 

DIALOG (R) File 348:EUROPEAN PATENTS 

(c) 2004 European Patent Office. All rts. reserv. 

01335027 

System and method for predicting web user flow by determining association 

strength of hypermedia links 
System und Verfahren zur Vorhersage von Benutzerstromen im World Wide Web 

durch Bestimmung der Verbindungsstarke von Querverweisen zwischen 

Hypermedia-Dokumenten 
Systeme et procede pour predire le flux d'utilisateurs web en determinant 

la force de connection. des liens entre des documents hypermedia 
PATENT ASSIGNEE: 

Xerox Corporation, Patent Department, (3184840), Xerox Square - 20A, 100 
Clinton Avenue South, Rochester, New York 14644, (US), (Applicant 
designated States: all) 
INVENTOR: 

Pirolli, Peter L., 2958 Sloat Boulevard, San Francisco, California 94116, 
(US) 

Chi, Ed H., 488 University Avenue 412, Palo Alto, California 94301, (US) 
Pitkow, James E., 742 Ellsworth Place, Palo Alto, California 94306, (US) 

LEGAL REPRESENTATIVE: 

Grunecker, Kinkeldey, Stockmair & Schwanhausser Anwaltssozietat (100721) 
, Maximilianstrasse 58, 80538 Munchen, (DE) 

PATENT (CC, No, Kind, Date) : EP 1139236 Al 011004 (Basic) 

APPLICATION (CC, No, Date): EP 2001108054 010329; 

PRIORITY (CC, No, Date): US 540976 000331 

DESIGNATED STATES: DE; FR; GB 

EXTENDED DESIGNATED STATES: AL; LT; LV; MK; RO; SI 
INTERNATIONAL PATENT CLASS: G06F-017/30 

ABSTRACT EP 1139236 Al 

The present invention provides a system and method for predicting user 
traffic flow in a collection of hypermedia documents by determining the 
association strength of the hypermedia links. Hypermedia links are 
identified among a plurality of documents, where the documents include 
content items such as keywords that may or may not be relevant to a user 
information need. The distribution of the content items in the document 
collection is then determined. An information item or query is received 
as input, and is compared to the content items. In response to the 
comparison, association strengths are assigned to the hypermedia links. A 
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..SPECIFICATION one parent and the children of a given object are 
explicitly identified. An object's relevance score is then calculated 
as a function of its content -based relevance score and the 
relevance scores of its children. Relevance scores must be 
propagated from the leaves of a hierarchy to the root (see, Frisse, M. . . 

. .hit-list. 

In the third technique, the content of neighbouring objects is added to 
the content of the current object when determining the relevance 
score for the current object (see Croft et al. "Retrieving Documents by 
Plausible Inference: an Experimental... 
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English Abstract 

A method of aggregating and disseminating time sensitive information. A 
data source is searched to identify recently-posted information items 
matching predetermined selection criteria. A respective relevance score 
is calculated for each identified information item. A respective urgency 
rating is determined for each identified information item. Each 
information item is triaged using the urgency rating, and disseminated to 
at least one client based on the triage result. 

French Abstract 

L 1 invention concerne un procede de recueil et de dissemination 
d 1 informations temporaires. On consulte une source de donnees afin 
d 1 identifier les articles d' information postes recemment et qui 
correspondent a des criteres de selection predetermines. On calcule un 
resultat pertinent respectif pour chaque article d 1 information identifie. 
On determine par la suite une evaluation d'urgence respective pour chaque 
article d 1 information identifie. On trie chaque article d 1 information a 
l f aide de l 1 evaluation d'urgence et on le repand a au moins un client 
d ! apres le resultat du triage. 
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Claim 

... of each 

terms of the taxonomy, and; 
- 22 

for each taxonomy term located in the content , adding the respective 
relevancy weighting to the composite relevancy score . 

11 A method as claimed in claim 9, wherein the taxonomy comprises a 

plurality 

of. . . 

...occurrences of each term of 
the taxonomy, and; 

for each taxonomy term located in the content , adding the respective 
relevance weighting to a composite relevancy score pertaining to each 
category. 

29 A method as claimed in claim 28, further comprising a. . . 
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English Abstract 

A search request is received at an information retrieval system from a 
searcher. The search request preferably contains at least one search term 
and a user identifier. A plurality of objects are then searched based on 
the at least one search term. At least one located object is found from 
the plurality of objects. The at least one located object is associated 
with the search term(s) . An intrinsic score based on the search term(s) 
is subsequently calculated for each located object. The intrinsic score 
is then adjusted to an adjusted score based on the difference between a 
creator expertise of a creator of the at least one located object and/or 
a contributor expertise of a contributor to the at least one located 
object, and a searcher expertise of the searcher. 

French Abstract 

Une demande de recherche emanant d'un chercheur est recue dans un systeme 
de recherche d' information. La demande de recherche renferme de 
preference au moins un critere de recherche et un identif icateur 
d' utilisateur . Une pluralite d'objets sont alors cherches en se basant 
sur au moins un critere de recherche. Au moins un objet defini est trouve 
a partir de la pluralite d'objets. Au moins I'objet defini est associe 
au(x) critere (s) de recherche. Un classement intrinseque sur la base 
du/ou des criteres de recherche est ensuite calcule pour chaque objet 
defini. Le classement intrinseque est alors ajuste par rapport a un 
classement ajuste, base sur la difference entre une expertise de creation 
emanant d'un createur d'au moins ledit objet defini et/ou une expertise 
de contribution d'un participant au moins audit objet defini, et une 
expertise de recherche emanant du chercheur. 

Legal Status (Type, Date, Text) 

Publication 20031224 A2 Without international search report and to be 

republished upon receipt of that report. 



Fulltext Availability: 
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Detailed Description 

. . . it adjusts the score by slightly raising the adjusted score when only 
a few relevant content objects are found in the database. 

Therefore, the absolute relevance score for a given user (or the user 
expertise) is. 

(I 2) RS-EXP-ABS = 3. . . 
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English Abstract 

A method and a system for the creation of agents to be used for 
recommending media content to a user (207) . The agent comprises a profile 
of media content and can further comprise an algorithm for recommending 
media (201) on a media system (208). The media system (208) can be a VCR, 
a TV, a set-top box, a DVD player, a radio or a personal computer. The 
method includes the steps of validating an agent in an off-line system 
(210), transferring agents forth and back between the off-line system 
(210) and an on-line system (203) . The step of validating includes the 
steps of composing a test set of historical content, estimating scores of 



an agent in the test set, determine average difference between agent 
scores and historical relevance feedback, determining performance of an 
agent using average difference of all agents over the test set and 
reliability of historical relevance feedback. The method further includes 
the steps of nominating agents in the on-line system (203), selecting 
agents and copying agents forth and back between the on-line system (203) 
and the off-line system (210); importing, generating, training, ranking, 
nominating and deleting agents in the off-line system (210) . 

French Abstract 

L' invention concerne un procede et un systeme de creation d' agents 
destines a etre utilises pour recommander un contenu multimedia a un 
utilisateur (207) . Cet agent comprend un profil de contenu multimedia et 
eventuellement un algorithme permettant de recommander des elements 
multimedia (201) dans un systeme multimedia (208) . Ce systeme multimedia 
(208) peut se presenter sous la forme d'un magnetos cope a cassettes 
(VCR), d f une television, d'un boitier decodeur, d ! un lecteur de DVD, d'un 
poste de radio ou d'un ordinateur personnel. Le procede comprend les 
etapes consistant a : valider un agent dans un systeme hors ligne (210) ; 
et transferer des agents bidirectionnellement entre ce systeme hors ligne 
(210) et un systeme en ligne (203). L'etape de validation comprend les 
etapes consistant a : composer un ensemble de controle de contenu 
historique ; evaluer les resultats d ! un agent pour cet ensemble de 
controle ; determiner la difference moyenne entre les resultats de cet 
agent et la retroaction de pertinence historique ; determiner les 
performances d'un agent au moyen de la difference moyenne de tous les 
agents pour 1 ! ensemble de controle et la fiabilite de la retroaction de 
pertinence historique. Ce procede comprend en outre les etapes consistant 
a : designer des agents dans le systeme en ligne (203) ; selectionner des 
agents et copier des agents bidirectionnellement entre le systeme en 
ligne (203) et le systeme hors ligne (210) ; importer, generer, 
entrainer, classer, designer et supprimer des agents dans le systeme hors 
ligne (210) . 
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. . . is defined as the absolute 

difference between the estimated score of an agent for media content x 
and the relevance feedback score - relevance in the formula - of 
media content item x. 



dif f erencej . , =1 score j , , -relevance,, 

The estimated score of an agent j - score in the... 

...procedure for the agent population may be as follows. 



- Compose a test set of media content items (i.e. information about 
media content ) with their corresponding relevance feedback scores , 
from the data in the relevance feedback database, i.e. from the database 
of reference. . .the information about media content and let each agent 
generate a score for this 

media content , 

- Take the relevance feedback score from the user and calculate the 



difference for each 
agent, 

- Repeat the last two steps... 



10/5, K/15 (Item 15 from file: 349) 

DIALOG (R) File 349: PCT FULLTEXT 

(c) 2004 WIPO/Univentio. All rts. reserv. 

00970434 **Image available** 

METHOD AND APPARATUS TO DISTRIBUTE CONTENT USING A MULTI-STAGE BROADCAST 
SYSTEM 

PROCEDE ET APPAREIL DE DISTRIBUTION DE CONTENU AU MOYEN D 1 UN SYSTEME DE 
DIFFUSION MULT I -E TAGE S 

Patent Applicant/Assignee: 

INTEL CORPORATION, 2200 Mission College Boulevard, Santa Clara, CA 95052, 
US, US (Residence), US (Nationality) 
Inventor (s) : 

CONNELLY Jay, 3148 NW 126th Place, Portland, OR 97229, US, 
Legal Representative: 

MALL IE Michael J (agent), Blakey, Sokoloff, Taylor & Zafman, 12400 
Wilshire Boulevard, 7th Floor, Los Angeles, CA 90025, US, 
Patent and Priority Information (Country, Number, Date) : 

Patent: WO 2002103941 A2-A3 20021227 (WO 02103941) 

Application: WO 2002US17270 20020531 (PCT/WO US0217270) 

Priority Application: US 2001882205 20010615 
Designated States: AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU 

CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP 

KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO 

RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG UZ VN YU ZA ZM ZW 

(EP) AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR 

(OA) BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG 

(AP) GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW 

(EA) AM AZ BY KG KZ MD RU TJ TM 
Main International Patent Class: H04H-009/00 
International Patent Class: H04N-007/173 
Publication Language: English 
Filing Language: English 
Fulltext Availability: 

Detailed Description 

Claims 

Fulltext Word Count: 1332 6 
English Abstract 

A broadcast system, method and apparatus providing content on demand. In 
one embodiment, the disclosed broadcast system includes a server that 
broadcast content descriptors to a plurality of clients. The content 
descriptors describe available content that can be broadcast or 
potentially be broadcast later by the server. Each client receives the 
broadcasted content descriptors from the server and updates and maintains 
a local content descriptor table and a demand data table. Based on the 
content descriptors, previous access habits of the user and optional user 
classifications, the client system update demand data indicating the 
desirability of the pieces of available content. The demand data is fed 
back to the server and the server then sorts the available content based 
on the desirability indicated by demand data. The server then broadcasts 
further descriptive content to the clients. The clients receive the 
further descriptive content enabling further demand data feedback from 
the clients to the server regarding the desirability of the pieces of 
content. The feedback process can be repeated to narrow the list of 
available client such that the more demand content is ultimately 



broadcast by the server to the clients. In various aspects of the present 
invention, the content descriptors can be sent from the server to the 
clients in a numerous manners and the demand data can be sent from the 
clients to the server in numerous manners. 

French Abstract 

L 1 invention concerne un systeme, un procede et un appareil de diffusion 
permettant de fournir un contenu sur demande. Dans un mode de 
realisation, le systeme de diffusion selon 1' invention comprend un 
serveur qui diffuse des descripteurs de contenu a une pluralite de 
clients. Ces descripteurs de contenu decrivent un contenu disponible 
pouvant etre diffuse ou pouvant eventuellement etre diffuse 
ulterieurement par le serveur. Chaque client recoit du serveur les 
descripteurs de contenu diffuse, met a jour et conserve une table de 
descripteurs de contenu locale et une table de donnees de demandes . En 
fonction des descripteurs de contenu, d 1 habitudes d'acces anterieures de 
1 1 utilisateur et de classifications utilisateur f acultatives, le systeme 
client met a jour des donnees de demandes indiquant la desirabilite des 
pieces de contenu disponible. Les donnees de demandes sont reacheminees 
vers le serveur qui trie ensuite le contenu disponible en fonction de la 
desirabilite indiquee par les donnees de demandes. Le serveur diffuse 
ensuite un autre contenu descriptif aux clients. Les clients recoivent 
cet autre contenu descriptif qui permet une autre retroaction de donnees 
de demandes provenant des clients vers le serveur, qui concerne la 
desirabilite des pieces de contenu. Le processus de retroaction peut etre 
repete pour reduire la liste de clients disponibles, de sorte que le 
contenu le plus demande est finalement diffuse aux clients par le 
serveur. Dans divers aspects de l 1 invention, les descripteurs de contenu 
peuvent etre envoyes du serveur aux clients de plusieurs manieres et les 
donnees de demandes peuvent etre envoyees des clients au serveur de 
plusieurs manieres. 
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Detailed Description 

one embodiment, content descriptor table 801 includes a list of 
attributes, attribute values and corresponding relevance values and 
believability factors. In particular, content descriptor table 801 
includes attribute values "Joe Smith, " "Jane Doe, " "action, " and 
"comedy. " At this. . . 

. . .maintained as the user interacts with the client system. 

1 5 In one embodiment, the relevance values in content descriptor 
table . . .user to watch a particular movie because of this particular 
attribute value. In one embodiment, relevance values in content 
descriptor table 801 are within a range of values such as for example 
from - l...that "Action Dude" features actor "Joe Smith" and is an 
"action" movie. Thus, referring to content descriptor table 801 in 
Figure I 1, the relevance values for attribute values "Joe Smith" and 
"action" are ad.usted to reflect that the user... Funny Show" features 
actress "Jane Doe" and is a "comedy" movie. Thus, referring back to 



content descriptor table 90 I in Figure I 1, the relevance values 
for attribute values "Jane Doe" and "comedy" are adjusted to reflect that 
the user explicitly ... data table 90 1, as described in process block 61 
1, are related to the relevance values and believability factors of 
the attribute values listed in content descriptor table 801. A detailed 
description of the processing that occurs in process block 61... the 
relevance values for these attribute values were greater than zero. 

In one embodiment, the relevance values associated with implicitly 
rated data files are also increased in content descriptor table 801 in 
response to a user access. However, in the example shown in 12, "Action 
Dude" was explicitly classified by the user. In one embodiment, the 
relevance values are not updated in content descriptor table 80 I in 
response to a user access of data files explicitly classified. .. and 
"action." Content descriptor table 801 of Figure 12 shows that "Joe 
Smith" has a relevance value of 1 and a believability factor of 1. 
Content descriptor table 8 0 1 of Figure 12 also shows that "action" has 
a relevance ... data table 90 1, "Blast f Em" has the attribute values "Jane 
Doe" and "action." The relevance value and believability factors for 
"Jane Doe" in content descriptor table 801 of Figure 12 are -1 and 0, 
respectively. Thus, the rating of . . . 
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English Abstract 

Methods and apparatuses providing feedback to a server from a client in a 
content distribution broadcast system. In one aspect, feedback is sent 
from a client to a server in response to a trigger. In another aspect, 
feedback is sent after a predetermined amount of time has lapsed. In yet 
another aspect, feedback is sent after a rankings or ratings have been 
generated for a predetermined number of pieces of content. In still 
another aspect, feedback is sent after a predetermined amount of content 
has been consumed. In yet another aspect, feedback is sent when the 
amount of unconsumed content is less than a predetermined threshold 
amount . 



French Abstract 

L' invention concerne des procedes et des dispositifs permettant a un 
client d'envoyer une retroactions dans un systeme de diffusion de 
contenu. Dans un aspect de 1' invention, la retroaction est envoyee par le 
client a un serveur en reponse a un declenchement . Dans un autre aspect 
de 1' invention, la retroaction est envoyee apres que des classements ou 
des caracterisations ont ete generes pour un nombre predetermine de 
bribes de contenu. Dans un autre aspect de 1' invention, la retroaction 
est envoyee apres qu'une quantite predeterminee de contenu a ete 
consommee. Dans un autre mode de realisation, la retroaction est envoyee 
lorsque la quantite de contenu non consommee est inferieure a une 
quantite de seuil predeterminee. 
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Detailed Description 

one embodiment, content descriptor table 801 includes a list of 
attributes, attribute values and corresponding relevance values and 
believability factors. In particular, content descriptor table 801 

includes attribute values "Joe Smith Jane Doe," "action," and 4 

4 comedy . " At . . . 

. . .maintained as the user interacts with the client system. 

1 5 In one embodiment, the relevance values in content descriptor 
table 801 are indicators as to how relevant the associated attribute and 
attribute values ...user to watch a particular movie because of this 
particular attribute value. In one embodiment, relevance values in 
content descriptor table 801 are within a range of values such as for 
example from -10... that "Action Dude" features actor "Joe Smith" and is 
an "action" movie. Thus, referring to content descriptor table 801 in 
Figure I 1, the relevance values for attribute values "Joe Smith" and 
"action" are adjusted to reflect that the user explicitly ... Funny Show" 
features actress "Jane Doe" and is a "comedy" movie. Thus, referring back 
to content descriptor table 801 in Figure I 1, the relevance values 
for attribute values "Jane Doe" and "comedy" are adjusted to reflect that 



the user explicitly ... demand data table 901, as described in process 
block 61 1, are related to the relevance values and believability 
factors of the attribute values listed in content descriptor table 801. 
A detailed description of the processing that occurs in process block 61 
...the relevance values for these attribute values were greater than 
zero . 

In one embodiment, the relevance values associated with implicitly 
rated data files are also increased in content descriptor table 801 in 
response to a user access. However, in the example shown in... 

...of Figure 12, "Action Dude" was explicitly classified by the user. In 
one embodiment, the relevance values are not updated in content 
descriptor table 801 in response to a ...and "action." Content 
descriptor table 801 of Figure 12 shows that "Joe Smith" has a relevance 
value of I and a believability factor of 1. Content descriptor table 
801 of F igure 12 also shows that "action" has a ...demand data table 
901, "Blast' Ern" has the attribute values "Jane Doe" and "action." The 
relevance value and believability factors for "Jane Doe" in content 
descriptor table 801 of Figure 12 are -I 
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English Abstract 

A method and system for periodically deriving an optimal batch broadcast 
schedule based on client demand feedback data from a distributed set of 
broadcast clients. The broadcast system includes an operation center that 
broadcasts meta-data to a plurality of client systems. The meta-data 
describes a plurality of pieces of content that are in consideration for 
upcoming broadcasts by the server. Each client receives the broadcasted 
meta-data from and sends back a set of client demand feedback data to the 
operations center, wherein the user feedback data reflects a client's 
interest level in at least a portion of the pieces of content. The 
feedback data, which typically may include ratings and/or relative 
rankings, may be user-generated, automatically-generated, or a 
combination of the two. The system then send a batch of content based on 
an aggregation of the feedback data in combination with available 
broadcast bandwidth and broadcast schedule window. 

French Abstract 

L 1 invention concerne un procede et un systeme de derivation periodique 
d'un programme de diffusion de lot optimal base sur des donnees de retour 
de demande client provenant d'un ensemble reparti de clients recevant la 
diffusion. Le systeme de diffusion est dote d'un centre operationnel 
diffusant des meta-donnees a plusieurs systemes clients. Les meta-donnees 
decrivent plusieurs pieces de contenu qui sont considerees pour les 
diffusions ulterieures par le serveur. Chaque client recoit les 
meta-donnees diffusees et renvoie un ensemble de donnees de retour de 
demande client au centre operationnel. Les donnees de retour utilisateur 
refletent le niveau d'interet du client dans au moins une partie des 
pieces de contenu. Les donnees de retour, qui peuvent generalement 
inclure des evaluations et/ou des classements relatifs, peuvent etre 
generees par 1 1 utilisateur, generees automatiquement ou encore se 
presenter comme une combinaison des deux. Ce systeme envoie ensuite un 
lot de contenu base sur un regroupement de donnees de retour combinees a 
une largeur de bande de diffusion disponible et une fenetre de 
programmation de diffusion. 
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Detailed Description 

. . . of the data files, the answer to a decision block 407 is YES, and the 
relevance values of the particular attributes of the classified pieces 
of content are updated in meta-data table 601 in a block 409. In a 
block 41... the relevance values for these attribute values were greater 
than zero. 

In one embodiment, the relevance values associated with implicitly 
rated pieces of content are also increased in meta-data table 601B in 
response to a user access. However ... for the attribute values of the data 
file. 

To illustrate, referring to "Action Dude 11 in content rating table 701A, 
the product of the relevance value and believability factor of "Joe 
Smith" is I * 1, which 
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English Abstract 

A broadcast method and system for continuously and opportunistically 
driving an optimal broadcast schedule based on most recent client demand 
feedback from a distributed set of broadcast clients. The broacast system 
includes an operation center that broacasts meta-data to a plurality of 
client systems. The meta-data describes a plurality of pieces of content 
that are in consideration for upcoming broacasts by the server. Each 
client receives the broacasted meta-data from and sends back a set of 
client demand feedback data to the operations center, wherein the user 
feedback data reflects a client's interest level in at least a portion of 
the pieces of content. The feedback data, which typically may include 
ratings and/or relative rankings, may be user-generated, 
automatically-generated, or a combination of the two. The system then 
determines a most opportunistic piece of content to be broadcast based on 
an aggregation of the client demand feedback data. 

French Abstract 

Procede et systeme de diffusion permettant d'elaborer en continu et de 
maniere opportune un programme optimal de diffusion sur la base des 
exigences les plus recentes exprimees par un groupe reparti de clients. 
Ledit systeme de diffusion comporte un centre des operations qui diffuse 
des meta-donnees a une pluralite de systemes clients. Les meta-donnees 



decrivent une pluralite de pieces de contenu qui sont envisagees pour des 
diffusions futures par le serveur. Chaque client recoit les meta-donnees 
en provenance du centre des operations et renvoie a ce centre une serie 
de donnees de reaction sur ses exigences, les donnees de reaction 
produites par les utilisateurs refletant le niveau d'interet d f un client 
concernant au moins une partie des pieces de contenu. Les donnees de 
reaction, qui peuvent comporter typiquement des evaluations et / ou des 
classements relatifs, peuvent etre produites par 1 1 utilisateur, produites 
automat iquement ou une combinaison des deux. Le systeme determine ensuite 
la piece de contenu la plus opportune a diffuser, sur la base du 
regroupement des donnees de reaction sur les exigences des clients. 
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. . . of the data files, the answer to a decision block 407 is YES, and the 
relevance values of the particular attributes of the classified pieces 
of content are updated in meta-data ...the relevance values for these 
attribute values were greater than zero. 

In one embodiment, the relevance values associated with implicitly 
rated pieces of content are also increased in meta-data table 601B in 
response to a user access. However ... attribute values of the data file. 

1 5 To illustrate, referring to "Action Dude" in content rating table 
701 A, the product of the relevance value and believability factor of 
"Joe Smith" is I * 1, which equals 1. 

The product of 
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English Abstract 

A relevancy ranking and clustering method and system that determines the 
relevance of a document relative to a user's query using a similarity 
comparison process. Input queries are parsed into one or more query 
predicate structures using an ontological parser. The ontological parser 
parses a set of known documents to generate one or more document 
predicate structures. A comparison of each query predicate structure with 
each document predicate structure is performed to determine a matching 
degree, represented by a real number. A multilevel modifier strategy is 
implemented to assign different relevance values to the different parts 
of each predicate structure match to calculate the predicate structure's 
matching degree. The relevance of a document to a user's query is 
determined by calculating a similarity coefficient, based on the 
structures of each pair of query predicates and document predicates. 
Documents are autonomously clustered using a self-organizing neural 
network that provides a coordinate system that makes judgments in a 
non-subjective fashion. 

French Abstract 

L' invention concerne un procede et un systeme de classement et 
regroupement par pertinence qui determine la pertinence d'un document par 
rapport a une demande d' utilisateur au moyen d'un processus de 
comparaison par similitude. Des demandes d' entree sont analysees selon 
une ou plusieurs structures de predicat d' interrogation au moyen d'un 
analyseur ontologique . Cet analyseur ontologique analyse un ensemble de 
documents connus afin de generer une ou plusieurs structures de predicat 
de document. Une comparaison de chaque structure de predicat 
d' interrogation avec chaque structure de predicat de document est 
effectuee afin de determiner un degre de correspondance, represente par 
un nombre reel. Une strategie de modification multiniveau est mise en 
oeuvre afin d'attribuer differentes valeurs de pertinence aux differentes 
parties de chaque correspondance de structure de predicat et afin de 
calculer le degre de correspondance de la structure de predicat. La 
pertinence d'un document pour une demande d ! utilisateur est determinee 
par le calcul d'un coefficient de similitude, sur la base des structures 
de chaque paire de predicats de demande et de predicats de document. Les 
documents sont regroupes de facon autonome au moyen d'un reseau neuronal 
auto-organisateur qui produit un systeme de coordonnees qui effectue des 
jugements de facon non subjective. 
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Detailed Description 

words but on patterns of 
conceptual predicate structures. Dynamic alteration can 
be made to the content of the document sets, thus 
allowing the relevancy ranking and clustering method and 
system to begin its processing even before the search for 
potential . . . 
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English Abstract 



On-line asynchronous distributed authoring and problem solving system,, 
method, and computer program (100) for focusing attention toward 
particular authoring and problem solving topics (112-116) using a 
threaded discussion group and reward matrix (120-124). Mechanism for 
directing attention and focus of large numbers of people who are solving 
problems using a tree-based problem space, where the tree-based problem 
space may be a virtual problem space. Algorithms and procedures for 
evaluating nodes in the virtual problem space and assigning values via a 
pay-off matrix that serves to focus the attention of large numbers of 
problem solvers. Combination of threaded discussion groups with the 
pay-off matrix and a variety of algorithms to create useful system for 
solving multi-level problems leveraging human expertise. 

French Abstract 

La presente invention concerne un systeme, un procede et un programme 
informatique (100) pour creer et pour resoudre des problemes en ligne, de 
maniere asynchrone et repartie, afin d'attirer 1' attention sur des sujets 
de creation et de resolution de problemes particuliers (112-116), par 
utilisation d ! un groupe de fils de discussion et d f une matrice de 
recompense (120-124). La presente invention concerne un mecanisme 
permettant de diriger 1' attention et l'interet d'un grand nombre de 
personnes qui sont en train de resoudre un probleme, par utilisation d f un 
espace de probleme base sur un arbre, espace qui peut etre virtuel. Des 
algorithmes et des procedures permettent d ! evaluer des noeuds dans 
1' espace de probleme virtuel et d f attribuer des valeurs, par 
1 1 intermediaire d'une matrice de gain qui sert a attirer l 1 attention d'un 
grand nombre de personnes en train de resoudre un probleme. La presente 
invention concerne egalement la combinaison de groupes de fils de 
discussion avec la matrice de gain et d ! une variete d 1 algorithmes , afin 
de creer un systeme utile permettant de resoudre des problemes a 
plusieurs niveaux sous 1' influence de connaissances humaines. 
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... on the amount of instructional content desired for each test item, 
several linkers might tag content that they found most appropriate. 

The relevance value of those items tagged by several linkers could be 
increased, so that the most popular. . . 
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English Abstract 

A method for providing searching and alerting capabilities in traffic 
content at access points in data networks is disclosed. Typical access 
points for Internet, intranet and wireless traffic are described. Traffic 
flow through an Internet Service Provider is used as a preferred 
embodiment to exemplify the data traffic used as the input source in the 
invention. The invention teaches how proper privacy and content filters 
can be applied to the traffic source. The filtered data stream from the 
traffic flow can be used to improve the quality of existing searching and 
alerting services. The invention also teaches how a cache can be 
developed optimized for holding fresh searchable information captured in 
the traffic flow. It is further disclosed how the said cache can be 
converted to a searchable index and either separately or in cooperation 
with external search indexes be used as a basis for improved search 
services. The invention also discloses how the traffic flow can be 
analyzed in order to derive addes information for measuring document 
relevance, access similarity between documents, personalized ranking of 
search results, and regional differences in document accesses. 

French Abstract 

La presente invention concerne un procede qui assure des capacites de 
recherche et divertissement dans un contenu de trafic au niveau de 
points d'acces dans des reseaux de donnees. Des points d'acces 
specif iques pour Internet, intranet et le trafic sans fil sont presentes. 
Le flux de trafic a travers un fournisseur de service Internet est 
utilise comme forme de realisation preferee pour illustrer le trafic des 
donnees sur lequel se fonde la presente invention. Cette invention 
demdntre comment des filtres de conf identialite et de contenu peuvent 
etre utilises pour la source du trafic. Le flux de donnees filtrees 
provenant du flux de trafic peut etre utilise pour ameliorer la qualite 
des services de recherche et divertissement existants . Cette invention 
montre egalement comment une memoire cache peut etre mise en oeuvre de 



maniere optimisee pour conserver des informations toutes nouvelles 
pouvant etre recherchees qui ont ete extraites du flux de trafic. 
L 1 invention montre egalement comment la memoire cache peut etre 
transformee en un index pouvant etre recherche et utilise, soit 
separement soit en cooperation avec des index de recherche ext ernes, en 
tant que base pour des services de recherche ameliores. Cette invention 
demontre egalement comment le flux de trafic peut etre analyse pour 
obtenir des informations supplementaires servant a mesurer la pertinence 
de documents, la similarite d'acces entre des documents, le classement 
personnalise de resultats de recherche et les differences regionales dans 
l'acces aux documents. 
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. . . A method according to claim 1, characterized by the searching step 
including a substep for relevancy ranking depending on access 
counting of individual traffic content units through an access point. 

14 A method according to claim 1, wherein the searching... 
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English Abstract 

Method and system for determining personal characteristics of an 
individual or group and using same to provide personalized advice or 
services. The system dynamically incorporates several personality 
dimensions, life style, quality of life, cultural context, demographics, 
and psychographics, as requested by the test administrator or individual 
user, and controls and standardizes the testing protocol, and retains 
test data in such a way that individuals and non-professional users can 
reliably self administer the tests, save their test results in a system 
database, and use the results to obtain personality-based advice, 
content, and people-matching services from a system proprietor. 

French Abstract 

L ! invention concerne un procede et un systeme permettant de determiner 
les caracteristiques personnelles d ! un individu ou d ! un groupe, et sur la 
base desdites caracteristiques, de fournir des conseils ou des services 
personnalises . Le systeme selon l 1 invention incorpore dynamiquement 
plusieurs traits de la personnalite, le style de vie, la qualite de vie, 
le contexte culturel, des donnees demographiques et phychographiques , tel 
que requis par 1 1 administrateur du test ou un utilisateur individuel, 
controle et normalise le protocole de test, et conserve les donnees du 
test de maniere que les individus et les utilisateurs non prof essionnels 
puissent gerer eux-memes ces tests, de facon fiable, sauvegarder leurs 
resultats dans une base de donnees du systeme, et utiliser lesdits 
resultats pour obtenir du titulaire du systeme des conseils, des contenus 
et des services de mise en relation de personnes bases sur la 
personnalite. 
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history of the individual, and compares this with the personality, 
psychographic, behavior, and declared preference relevance values 
from all users to determine the optimum. content to display to the 
user . 

29 

Content selected in step 3220 includes, but is not... 
...3200A) in that (inverted exclamation mark)t is information that the user 
is actively seeking. Content is passively presented to the user based 
upon relevance values , history, and. 



declared preferences. 



The process begins at step 3240 where the system, must determine ... break 
down the areas of confliets and agreement for those two types. 

1 5 Like content in 3200A, all advice stored in database 3300 has 
relative relevant strength values associated with every 
classification scheme. In other words, for the Keirsey Temperament Sorter 
personality scheme. styles, psychograpf f les and preferences or other 
concepts. The feedback is then used to modify the relevance strength 
values used to make matches between users and content or advice, or 
people . 

Example 4 
37 

A user is interested in finding a compatible ... sub j ect matter, content 
(3250), application (3240) and the media format (3260) is to present the 
content . The administrator will also be able to customize the relevance 
values to alter the matching algorithins to meet his/her needs. 

Module 3000 provides for matching... 
? tlO/5, k/25-26 
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English Abstract 

A method, apparatus, and system for improving the ability to access, map, 
search, navigate, and visualize complicated bodies of related 
information, for example, bodies such as the World Wide Web. A temporal 
user interface [9901] allows users to visualize complex data sets which 
are represented as raindrops in a pool [9904] of water across the time 
span [9911] of a rain storm. The invention dynamically analyzes and maps 
[9906] information to provide and display relevant information over time 
in the temporal user interface. The invention further provides a 
mechanism allowing users to perform directed searches which allows users 
to provide a relevant starting point, subject matter, and other criteria 
to the search facility [9913] that brings more meaningful results. The 
invention also provides an interpretive help facility [9914] which can 
analyze users histories of actions to provide relevant help based on 
those actions, and also allows users to correct mistakes. 

French Abstract 

La presente invention concerne un appareil, un procede et un systeme qui 
ameliorent la capacite d'acceder, de mapper, d 1 interroger, de naviguer et 
de visualiser des entites complexes d 1 informations connexes, par exemple 
des entites telles qu' Internet. Une interface utilisateur temporaire 

(9901) permet a des utilisateurs de visualiser des ensembles de donnees 
complexes representes comme des gouttes de pluie dans une flaque d'eau 

(9904) pendant la duree (9911) d ! une pluie d'orage. Les procede et 
systeme de 1' invention analysent et mappent (9906) de facon dynamique les 
informations a fournir et presentent au fur et a mesure les informations 
pertinentes dans 1* interface utilisateur temporaire. L' invention concerne 
egalement un mecanisme qui permet a un utilisateur d'effectuer des 
interrogations directes et de fournir au dispositif d 1 interrogation 

(9913) un point de depart pertinent, un sujet, et plusieurs autres 
criteres produisant des resultats plus signif icatif s . L 1 invention 
concerne en outre un dispositif d'aide interpretive (9914) qui analyse 
l'historique des actions effectuees par 1 1 utilisateur et fournit une aide 
pertinente fondee sur ces actions. Le dispositif d'aide interpretive 
permet egalement a 1 1 utilisateur de corriger ses erreurs . 
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Detailed Description 
size of 

the document, number of related links, number of photos, 
staleness of links, media content type, subject relevancy 

ranking , or the like. These criteria may be accessed and 
modified by the user through a... 
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English Abstract 

The system and method for automatically and contextually matching (14) 
offers to the content on a web page (22) that a user has selected in 
real-time. Additionally, the system and method further refines the 
resulting matches (14) based on business rules established by the web 
site (22A) owner or the system provider, thereby providing offers that 
are likely to generate selling opportunities to the web site (22A) owner. 

French Abstract 

La presente invention concerne un systeme et un procede permettant de 
mettre en correspondance (14) de facon automatique et contextuelle des 
off res sur un contenu de page web (22) qu'un utilisateur a selectionnees 
en temps reel. Ce systeme et ce procede affinent, en outre, les mises en 
correspondance (14) qui en resultent sur la base des reglements etablis 
par le proprietaire du site web (22A) ou du fournisseur du systeme, 
presentant ainsi des offres susceptibles de susciter des occasions de 
ventes pour le proprietaire du site web (22A) . 
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... the users can enter the Relevance score. This is a percentage that 
determines the minimum relevancy score of the product to the context 
of the content . 

The final configuration for the Business Rules set is the detennination 
of the sort order. . . 
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ABSTRACT EP 1160683 A2 

Indexing and retrieving documents stored on a network, can be extremely 
difficult. To alleviate these difficulties the present invention provides 
a computer implemented method and system of categorsing documents on a 
network, by storing documents classifications in a document 
classification datastore for use with a classification system having one 
or more categories, each category having a plurality of classifications 
which are linked in a hierarchical structure (140,142,144,146), including 
the steps of obtaining the classifications for a document for a first 
category; determining a binary identifier for the document for each of 
the obtained classifications in the first category, combining the 
determined binary identifiers to produce a combined binary identifier, 
and storing the combined binary identifier in a datastore in association 
with the document. The invention further provides for a computer 
implemented method and system for searching documents stored in a 
datastore which have been classified using a classification structure 
comprised of a plurality of levels, with each level having relations with 
adjacent levels, such that each classification in the classification in 
the classification may have ancestor classifications and/or descendent 
classifications, including the steps of obtaining (124) a search criteria 
from a user including at least one classification to be searched, 
searching (130) for all documents in the datastore which have a 
classification matching either the classifications provided by the user 
in the search criteria, or a classification which is an ancestor or 
descendent of the classification provided by the user. 
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..SPECIFICATION query vector is also calculated, and the dot product of 
the query vector and each document link vector is calculated. The dot 
products relating to a particular document are summed to determine the 
relevance ranking for each document . 

US58 64 84 6 provides a method of combining search result documents, as 
provided by. . . 
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Maintaining control of documents published on a network is difficult. 
To overcome this problem, a computer implemented method of publishing 
documents on a network is provided, comprising the steps of receiving a 
submitted document from a user, receiving a primary classification for 
the submitted document from the user, determining (86) a publisher 
associated with the primary classification, and assigning (88) the 
submitted document for review to the associated publisher. Further steps 
are provided for accepting a suitability indicator from the publisher for 
the submitted document, wherein a positive suitability indicator 
indicates the submitted document is suitable for publishing on the 
network, and publishing (96) the submitted document under the primary 
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...SPECIFICATION query vector is also calculated, and the dot product of 
the query vector and each document link vector is calculated. The dot 
products relating to a particular document are summed to determine the 
relevance ranking for each document . 

US58 64 84 6 provides a method of combining search result documents, as 
provided by. . . 
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English Abstract 

The present invention is directed to an integrated implementation 
framework and resulting medium for knowledge retrieval, management, 
delivery and presentation (Figure 8) . The system includes a first server 
component that is responsible for adding and maintaining domain-specific 
semantic information and a second server component that hosts semantic 
and other knowledge for use by the first server compoment that work 
together to provide context and time-sensitive semantic information 
retrieval services to clients operating a presentation platform via a 
communication medium. Within the system, all objects or events in a given 
hierarchy are active Agents semanticaly related to each other and 
representing queries (comprised of underlying action code) that return 
data objects for presentation to the client according to a predetermined 
and customizable theme or "Skin". This system provides various means for 
the client to customize and "blend" Agents and the unerlying related 
queries to optimize the presentation of the resulting information. 

French Abstract 

La presente invention concerne une structure de mise en oeuvre et un 
moyen associe integres, destines a 1 1 extraction, a la gestion, a la 
distribution et a la presentation de connaissances (Fig. 8). Ce systeme 
comprend un premier composant de serveur concu pour accumuler et mettre a 
jour des informations semantiques specif iques a un domaine et un second 
composant de serveur concu pour heberger des donnees semantiques et 
d'autres connaissances destinees a etre utilisees par le premier 
composant de serveur. Le premier et le second serveur cooperent pour 
fournir des services d 1 extraction d 1 informations semantiques asservies au 
temps et de contexte a des clients qui exploitent une plate-forme de 
presentation par 1 1 intermediaire d ! un moyen de communication. Dans le 
systeme, les objets ou les evenements dans une hierarchie donnee sont des 
Agents actifs lies semantiquement les uns aux autres, representant des 
requetes (comprenant des codes d 1 action sous- jacents) qui permettent de 
transmettre des objets de donnees pour qu'ils soient presentes au client 
en fonction d'une <= enveloppe >= ou d'un theme predetermine et 
personnalisable . Ce systeme fournit plusieurs moyens permettant au client 
de personnaliser et de <= melanger >= les Agents et les requetes 
associees sous- jacentes pour optimiser la presentation des informations 
resultantes . 
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specified, categorization is performed by the server dynamically. 
Results are preferably sorted based on the relevance score , or the 
strength of the "belongs to category semantic link from the object to 
the category filter. 

"Favorites" Context Template. The Favorites Context Template (and its 
resulting Special... 
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English Abstract 

A system, method and medium are disclosed for crating and utilizing a 
virtual representation of a real user, for example a person or business 
entity, to facilitate consumer-to-business and business-to-business 
transactions on networks and other electronic media such as interactive 
digital television, personal digital assistants, and mobile phones. The 
system may be broken down conveniently into seven layers, with each later 
performing a group of functions within the system. A first layer assures 
the integrity and security of the consumer to business to business 
(C2B"sup"2) system of the present invention and the interface between the 
C2B"sup H 2 system and individual users. A second layer identifies and 
supports user requirements at a given time, and provides status 
information. A third layer acquires, stores and processes user attributes 
in order to form and nurture the user's virtual representation. A fourth 
layer defines and obtains from a user the data and information necessary 
to service his or her requirements and secures and filters responses from 
suppliers to meet those requirements. A fifth layer completes and stores 
commercial transactions and supports a full range of commercial 
processes. A sixth layer carries out aggregation and presentation of 
statistical data gathered anonymously from the attributes of a large 
number of users, and the dissemination of consumer demand data down the 
supply chain. A seventh layer supports location-specific services and 
responses to user requirements and preferences. Full integration of all 




seven layers into a cohesive whole results in a novel, integrated 
C2B"sup"2 system. 

French Abstract 

L' invention concerne un systeme, un procede et un support permettant de 
creer et d'utiliser une representation virtuelle d'un utilisateur reel, 
par exemple une personne ou une entreprise, afin de faciliter des 
transactions consommateur-entreprise et entreprise-entreprise sur des 
reseaux et sur d'autres supports electroniques tels que la television 
numerique interactive, des assistants numeriques personnels et des 
telephones mobiles. Le systeme peut etre commodement reparti en sept 
couches, chacune realisant ulterieurement un groupe de fonctions dans le 
systeme. Une premiere couche permet d 1 assurer I'integrite et la securite 
du systeme consommateur-entreprise-entreprise (C2B"sup"2) de l f invention 
et de 1' interface entre le systeme C2B"sup"2 et des utilisateurs 
individuels. Une deuxieme couche permet d 1 identifier et de supporter des 
besoins utilisateur a un moment donne, et d'obtenir une information 
d'etat. Une troisieme couche permet d'acquerir, de stocker et de traiter 
des attributs utilisateur de facon a former et a entretenir la 
representation virtuelle de 1 1 utilisateur . Une quatrieme couche permet de 
definir et d'obtenir d'un utilisateur les donnees et 1 1 information 
necessaires afin de repondre a ses besoins et de securiser et de filtrer 
des reponses de fournisseurs destinees a correspondre a ces besoins. Une 
cinquieme couche permet de completer et de stocker des transactions 
commerciales et de supporter un plein domaine de processus commerciaux. 
Une sixieme couche realise l'agregation et la presentation de donnees 
statistiques collectees anonymement a partir des attributs d'un grand 
nombre d' utilisateurs , ainsi que la dissemination des donnees de demande 
consommateur vers la chaine d ' approvisionnement . Une septieme couche sert 
de support a des services specif iques d 1 emplacement et a des reponses aux 
besoins et aux preferences utilisateur. L 1 integration complete de ces 
sept couches en un ensemble concerte mene a un systeme C2B"sup"2 integre. 
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. . . to a corresponding product or service record in the PSU 255 as wel] . 
as a pointer to a supplier record in SD 256. It also has a relevance 

rating for RealMe, defined by a Boolean combination of attribute 
values. For example, a particular car... 
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English Abstract 

A search engine for retrieving documents pertinent to a query indexes 
documents in accordance with hyperlinks pointing to those documents. The 
indexer traverses the hypertext database and finds hypertext information 
including the address of the document the hyperlinks point to and the 
anchor text of each hyperlink. The information is stored in an inverted 
index file, which may also be used to calculate document link vectors for 
each hyperlink pointing to a particular document. When a query is 
entered, the search engine finds all document vectors for documents 
having the query terms in their anchor text. A query vector is also 
calculated, and the dot product of the query vector and each document 
link vector is calculated. The dot products relating to a particular 
document are summed to determine the relevance ranking for each 
document . 



French Abstract 

Une machine de recherche permettant la recherche de documents relatifs a 
une consultation indexe des documents conformement a des hyperliens 
designant ces documents. L'indexeur parcourt la base de donnees 
hypertextes et localise des informations hypertextes, notamment l'adresse 
du document que les hyperliens designent et le texte d'ancrage de chaque 
hyperlien. Ces informations sont memorisees dans un fichier index 
inverse, lequel peut egalement etre utilise pour calculer des vecteurs de 
chainage documentaire pour chaque hyperlien designant un document 
particulier. Lorsqu ! une consultation est introduite, la machine de 
recherche localise tous les vecteurs documentaires relatifs aux documents 
presentant les termes de la consultation dans leur texte d'ancrage. Un 
vecteur de consultation est egalement calcule, et le produit scalaire du 
vecteur de consultation et de chaque vecteur de chainage documentaire est 
calcule. Les produits scalaires concernant un document particulier sont 
additionnes pour determiner la hierarchie de pertinence pour chaque 
document . 
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English Abstract 

...query vector is also calculated, and the dot product of the query 
vector and each document link vector is calculated. The dot products 
relating to a particular document are summed to determine the relevance 
ranking for each document . 



Detailed Description 

query where the query has at least one term, and where hyperlinks 
contain 



terms and point to corresponding documents . The method includes 
comparing the words in the query to the words in a hyperlink to obtain a 

relevance ranking for each hyperlink, and summing the relevance 
rankings 

for each hyperlink pointing to a particular document to obtain a 
summed 

relevance score for that document . 

The query may be represented by a query vector where the 
query. . . 

. .the 

hyperlinks includes calculating the dot product of the query vector with 
the I 0 document link vector for that hyperlink. Summing the 
relevance ranking 

for each hyperlink pointing to a document includes summing the dot 
products obtained using the document link vectors for a particular 
document to obtain the summed relevance score for that document . The 
summed relevance scores may then be compared to obtain a ranking of 
1 5 documents. 

The dimension for. . .the second vector indexed with Document B has only 
two dimensions. 



As described below, the document link vector file 114 A is 

used in calculating the relevance score with respect to a particular 

query. 

Instead of creating document link vector files automatically, it 
may be 

desirable to create document link vector files only upon receipt of 
a query. 

- 19 Thus, the only entries in the link vector... 5 and as calculated for 
"tutorial" is one. 



Once the query vector and all relevant document link vectors 

have been found or calculated, control passes to block 130 to calculate 

the 

relevance scores for each document. The relevance score is 

calculated by 

finding the dot product of each document link vector with the query 
vector. 

A dot product for vectors < a, b, c > and < d...to a dot product of 1. 
- 22 

At box 131, the dot products for all document link vectors 
pertaining to a particular document are summed to obtain a "vote" or 
summed score for a particular document. The summed relevance score 
for 

Document B is the sum of the dot products for each document link 
vector 

relating Document B, which equals 1 A similar calculation can be 
made by finding the dot product... 

.be transmitted 

from one computer to another over a network. 

In the example described, no hyperlinks point to Document 



A or C, so each of their relevance scores is zero, even though both 
Document A and Document C contain the words in the... use combination 
ranking may be when 

there are too few hyperlinks (such as only one link ) pointing to a 
document . In such a case, the relevance score based upon the one 
link 

may not be accurate, so a threshold can be set... 

...depend on the words 

appearing in documents themselves, or, if used in combination with 
conventional relevance ranking do not depend solely on words 
appearing in 

the documents. Instead, the relevance ranking depends on descriptions 
of 

those documents in the anchor text of hyperlinks pointing to the 
documents . 

Documents such as Document J described above will not have a high 

- 24 

summed relevance score because authors creating hypertext documents 
will 

not include hyperlinks in their documents pointing to Document J. 

The size of a document is no longer a factor in the relevance 

ranking , and therefore problems associated with document size can be 
avoided. 

The use of thesauruses may. . . 

...in the examples shown above, Sun's Java tutorial 

site will receive a high summed relevance rank even though the ten-n 
"Java tutorial" appears only once in the document. 

The ranking method based on hyperlinks pointing to a given 

document can be used to select the most popular documents in a specific 
field using the... 

Claim 

query, wherein the query comprises at least one 
terin, and wherein hyperlinks contain terms and point to corresponding 

documents , the method comprising: 
comparing the words in the query to the words in a 

hyperlink to obtain a relevance ranking for each hyperlink; and 

summing the relevance rankings for each hyperlink pointing 

to a particular document to obtain a summed relevance score for 

that 

document . 

11 The method of claim 10 wherein: 

a number of hyperlinks, each containing a particular term, 
may point to a document ; and 

the number of hyperlinks containing the particular term 

pointing to the document is indexed. .. words in the hyperlink comprises 

calculating the dot 

product of the query vector with the document link vector for that 
hyperlink. 

- 32 

20 The method of claim 19 wherein summing the 
relevance ranking for each hyperlink pointing to a document 

comprises 



f 



summing the dot products obtained using the document link vectors for 
a 

particular document to obtain the summed relevance score for that 
document . 

21 The method of claim 20 wherein the summed 

relevance scores for each document are compared to obtain a ranking 
of 

documents . 
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L10 ANSWER 1 OF 4 INFODATA COPYRIGHT 2004 FHS Potsdam on STN 
AN 2003(12) :2819 INFODATA ON: 2003-02819 (GMD-IZ) 

TI Web discovery and filtering based on textual relevance feedback learning. 
AU Lam, W. (Chinese Univ. of Hong Kong, Hong Kong, HK) ; Wang, W. (Chinese 

Univ. of Hong Kong, Hong Kong, HK) ; Yue, C. W. (Chinese Univ. of Hong 

Kong, Hong Kong, HK) 
SO Computational intelligence. An international journal. 

Cambridge, MA, US; Oxford, GB: Blackwell: (2003) V. 19 (2) p. 136-163, 10 

figs., 8 tabs., ca . 30 refs. 

ISSN: 0824-7935 
CY United States; United Kingdom 
DT Journal 
LA English 

AB. . . some favorable properties to fulfill the discovery objectives. 

Information retrieval techniques are adopted to evaluate the content-based 
relevance of each page being explored. The hyperlink 

information, in addition to the textual context, is considered in the 
relevance score evaluation of a Web page. WID allows 

users to provide three forms of the relevance feedback model, namely, the 
positive. 

L10 ANSWER 2 OF 4 INFODATA COPYRIGHT 2004 FHS Potsdam on STN 
AN 1999(6) :1307 INFODATA ON: 99-01307 (GMD-IZ) 

Call No. : UC1 112-98 
TI Collaborating in information space. 
AU Alsmeyer, D.; Owston, F. 

SO Online Information 98. 22nd International Online Information meeting. 
Proceedings . 

Editor(s): Graham, C; Kerr, J. 
Learned Information, Oxford (GB) 

Oxford, GB: Learned Information: 1998, p. 31-37 of XIX, 398 p., 6 figs., 12 
refs . 

Conference: International Online Information Meeting 22, London, GB, 
1998.12.08-1998.12.10 

Organizer (s ) : Learned Information, Oxford (GB) 

ISBN: 1-900871-31-9 
CY United Kingdom 
DT Book article; Conference 
TC (including examples) 
LA English 
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AB. . . will be enhanced with information from other sources, such as users' 
bookmark files and the results of internet searches. Basic 
relevance- ranked retrieval has been supplemented by 

building knowledge of the databases' metadata into the search tool. Users 
are encouraged by the. . . to browse through the spaces by keyword, 
classification code and other features rather than rely on simple keyword 
searching. Database records are enhanced with links 

through to the full text of journal articles, where these are available, 
or to the Library's disintermediated document delivery system. 

L10 ANSWER 3 OF 4 INFO DATA COPYRIGHT 2004 FHS Potsdam on STN 
AN 1996 (10) : 2356 INFO DATA ON: 96-02356 (GMD-IZ) 

TI Exploring the relationship between user satisfaction and relevance in 

information systems. 
AU Gluck, M . 

SO Information processing and management. An international journal. 

Oxford, GB: Pergamon Press: (1996) V. 32 (1) p. 89-104, 6 figs., 3 tabs., 

zahlr. refs. 

ISSN: 0306-4573 
CY United Kingdom 

TC Numerical data (including statistics) 
LA English 

AB. . . system. Relevance judgments of retrieved items were obtained through 
content analysis from sense-making questionnaires as s secondary 
observation technique. The content analysis provided 
relevance values on both five-category and two-category 

scales. Results indicate that relevance has strong relationships (gamma 
values from 0.74 to 0.89) with. 

L10 ANSWER 4 OF 4 RDISCLOSURE COPYRIGHT 2004 KENNETH MASON PUBL . on STN 
AN 4 64131 RDISCLOSURE 

TI A method of focusing relevancy ranking by using 

context dependent terms weights 
PA International Business Machines Corporation 
PI RD 464131 20021210 
PRAI RD2002-464131 20021120 
SO Research Disclosure, 2002 12, 464 

CODEN: RSDSBB; ISSN: 0374-4353 
DT Patent 
GIS 54742; 25570 
TX XXXXXX 

A method of focusing relevancy ranking by using 
context dependent terms weights 

Traditional full text search systems use so called TF-IDF formula for 

calculating document relevancy scores /ranking 

(see [1]). According to this method the document relevancy 

score S, related to the search term T, is calculated as S(T) = 

F(T) * logj N / n(T) 1, where. . . document B goes to the bottom. Now, 

let's assume that document A contains customer problem report, and 

document B contains a link to the patch that should be 

applied to resolve the problem reported by a customer. In a traditional 
search system the user, most. . . appear at the top of the hit-list. 
The proposed method changes the way the search system calculates document 
relevancy scores by introducing context-dependent 

weights of search terms. The method proposes calculating document scores 
for given search terms based on weights assigned to these. . . sales', 
while term 'WAS' is more salient in the latter context. 
General schema of the method of focusing relevancy ranking is 
depicted in Figure below. This diagram shows a user 

submitting a textual query to the search system. The 'Query Builder' 
module builds. . . Index' module. This hit-list of documents represents 
'draft search results' where documents are ranked using default (TF-IDF) 
relevancy ranking algorithm. The 'Focused Relevancy 
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Rankingl module recalculates relevancy ranking of 
the documents in the hit-list, based on the query 

category assigned by the 1 Query Classifier 1 module. To calculate new 
relevancy scores the 1 Focused Relevancy Ranking 1 
module uses appropriate context-dependent glossary 
that corresponds to the identified query category. 
Novelties : 

o novel method for calculating document relevancy scores, based on 
context-dependent terms weights 

o novel method of document relevancy ranking based on 
the context of the query 

o novel method of document relevancy ranking based on 
the user profile 
References : 

[1] R. Baeza-Yates, B. Ribiero-Neto, "Modern Information 
Retrieval&quot ; , Addison Wesley, 1999, 
ISBN 0-201-39829-X. 

[2] Y. Park, RJ. Byrd, B.K. Boguraev, &quot ; Automatic . 
TI A method of focusing relevancy ranking by using 
context dependent terms weights 
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" 01640042 DOCUMENT TYPE: Product 

PRODUCT NAME: Webinator (640042) 

Thunderstone Software LLC (395005) 
14837 Detroit #303 

Cleveland, OH 44107 United States 
TELEPHONE: (216) 820-2200 

RECORD TYPE: Directory 

CONTACT: Sales Department 

Webinator allows Web site administrators to create and provide a 
high-quality retrieval interface to collections of HTML documents. 
Administrator benefits include indexes multiple sites into one common 
index; detailed verification and logging of document linkages ; 
indexes/updates documents while the database is in use; allows multiple 
databases at a site; provides an SQL query interface to the database for 
maintenance and reports; allows remote sites to be copied to the local fil 
system; multiple index engines can run concurrently against a common 
database; and support provided for Adobe Acrobat PDF documents. User 
benefits include natural language; set logic; special pattern matchers 
(regular expressions, quantities and fuzzy patterns); relevance ranking 
; proximity controls; document similarity searches (Doc Surfing); link 
reference reports ; and in- context result listings. 

REVISION DATE: 20000830 
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00138803 DOCUMENT TYPE: Review 

PRODUCT NAMES: Teoma (073458) 



TITLE : Teoma Search Engine Goes Live 



AUTHOR: Wiggins, Richard W 
SOURCE: Information Today, vl9 n5 p28(l) May 2002 
ISSN: 8755-6286 

HOMEPAGE : http : //www . inf otoday . com 

RECORD TYPE: Review 

REVIEW TYPE: Product Analysis 

GRADE: Product Analysis, No Rating 

Teoma's Teoma search engine is now available on the Web, the company having 
announced that it intends to compete with Google. Ask Jeeves Web Properties 
acquired Teoma for $1.67 million in cash and stock options, and several 
executives have stayed with Teoma. Ask Jeeves 1 technology enables users to 
search against a large, manually built database of frequently asked 
questions and uses a more traditional engine. Teoma says it surpasses 
Google in returning the most relevant results because Google uses page 
ranking and basic linear hit lists, which cannot ferret out specialized but 
very relevant and trusted content. Teoma says its new service mark, 
1 Subject-Specif ic Popularity, ' provides 'Results, ' 'Refine, ' and 
'Resources' options. An expert on information architecture says Teoma's 
layered or segmented result sets can be highly advantageous to the 
searcher, since scientific research has shown that clustered search results 
can hit the mark better than basic relevance - ranked lists. More sites 
use clustered results, which could mean that there is also anecdotal 
evidence of benefits. Not all of Teoma's hit list hyperlinks are direct 
links to a corresponding page or site, but some instead are links to 
Teoma's redirector. A user who clicks a hit list is redirected immediately 
to the labeled URL. Teoma's logs probably record which hit list items have 
been clicked. Various experts provide their views on the potential efficacy 
of Teoma. 

REVISION DATE: 20020830 
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00108288 DOCUMENT TYPE: Review 

PRODUCT NAMES: Sonar Professional Windows 95 & Windows NT (016962) 

TITLE: Sonar Professional: The Best for Both Worlds 

AUTHOR: Staff 

SOURCE: Computer COUNSEL, v9 n3 p4 (2) Mar 1998 
ISSN: 1044-1794 

HOMEPAGE: http: //www. aspenpub . com 

RECORD TYPE: Review 
REVIEW TYPE: Review 
GRADE : B 

Virginia Systems 1 Sonar, Sonar Professional, and Sonar Image text retrieval 
and document imaging systems are cross-platform tools used by many 
customers that include government agencies, educational institutions, 
lawyers, researchers, teachers, large corporations, small companies, and 
individuals. Versions with different pricing are available for Windows 3.x, 
Windows 95, Windows NT, and the Macintosh. Sonar Pro for Windows 95/NT is a 
multi-user product, which permits multiple users in a network system to 
search and display the same files concurrently. Sonar Pro is very fast and 
is designed for archiving, research, and annotation of large quantities of 



text. About 12,000 pages of text can be searched each second to allow even 
complicated searches with multiple variables to be continued almost 
instantaneously. Features include Boolean, proximity, wildcard, phonetic, 
and synonym searching; relevancy ranking ; search summaries and reports; 
sticky notes that can be linked to pages ; sidetracking; basic document 
indexing; display of word associations; block/f ield/subdocument support; 
and integration with an included database system. Sonar Pro is rated very 
good overall, with good annotation and note functions. Sonar Image 3.8 adds 
document imaging to the text retrieval application, combining scanner 
control, optical character recognition (OCR), storage management, and 
high-speed text retrieval functions in one, full-functioned package. 



REVISION DATE: 20000830 



File 347:JAPIO Nov 197 6-2003/Nov (Updated 040308) 

(c) 2004 JPO & JAPIO 
File 350:Derwent WPIX 1963-2004/00,™ &UP=200419 

(c) 2004 Thomson Derwent 
File 348: EUROPEAN PATENTS 1978-2004/Mar W02 

(c) 2004 European Patent Office 
File 349:PCT FULLTEXT 1979-2002/UB=20040318, UT=20040311 

(c) 2004 WIPO/Univentio 

Set Items Description 

SI 4 AU= 1 ALPHA S A f 

1/9/1 (Item 1 from file: 350) 

DIALOG (R) File 350: Derwent WPIX 

(c) 2004 Thomson Derwent. All rts. reserv. 

015394679 **Image available** 

WPI Acc No: 2003-456820/200343 

XRPX Acc No: N03-363315 

Relevance rank determination method for web page, involves adjusting 
content-based relevance rank, based on link structure of pages including 
link rank values from in-cbming links 

Patent Assignee: ALPHA S A (ALPH-I) 

Inventor: * ALPHA S A* 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20030061214 Al 20030327 US 2001928962 A 20010813 200343 B 

Priority Applications (No Type Date) : US 2001928962 A 20010813 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20030061214 Al 11 G06F-007/00 

Abstract (Basic) : US 20030061214 .Al 

NOVELTY - A content-based relevance rank is determined for each of 
the pages based on a content of each page. The relevance rank is 
adjusted based on a link structure of the pages including link rank 
values from in-coming links. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following: 

(1) system for determining a relevance rank determination system; 

and 

(2) candidate pages set ranking method. 

USE - For determining relevance rank of web page in Internet and 
intranet . 

ADVANTAGE - Improved rankings for web pages are obtained based on 
the given search query. Relevance rankings are based on linguistically 
aware link analysis where link values incorporate content-based 
relevance values of associated pages as a function of the page link 
structure . 

DESCRIPTION OF DRAWING (S) - The figure shows a flow chart 
explaining the relevance rank determination process, 
pp; 11 DwgNo 3/4 
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Index generation method for database management system, involves 
presenting associative access of generated combined index of unstructured 
and structured data columns to database using query 

Patent Assignee: ALPHA S A (ALPH-I); DIXON P (DIXO-I); KAMI NAG A G (KAMI-I) 

Inventor: * ALPHA S A*; DIXON P; KAMI NAG A G 

Number of Countries: 001 Number of Patents: 001 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20030033275 Al 20030213 US 2001928894 A 20010813 200340 B 

Priority Applications (No Type Date): US 2001928894 A 20010813 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20030033275 Al 11 G06F-007/00 

Abstract (Basic) : US 20030033275 Al 

NOVELTY - One or more unstructured data columns including image 
data, video/audio data, text and, structured data columns are 
identified from a database. A combined index of unstructured and 
structured data columns is generated, and associative access is 
presented to the database using a query including both structured and 
unstructured conditions. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(1) database management system; and 

(2) data table search method. 

USE - For generating index for database management system (DBMS) 
(claimed), relational database management system (RDBMS) in computer 
system connected to network, such as Internet, intranet, wide area 
network (WAN), local area network (LAN), data communication network, 
etc. 

ADVANTAGE - Data retrieval for queries having combined structured 
and unstructured conditions are improved. The amount of data fetched to 
determine relevant rows is reduced, hence the memory used for caching 
is utilized better. Response time for queries is reduced. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
the database management system. 
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Response time optimization method in ranked data retrieval system, 
involves associating each candidate document to relevance score bins 
according to its matched term weight corresponding to search query term 
weights 

Patent Assignee: ALPHA S A (ALPH-I) 
Inventor: * ALPHA S A* 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20030028520 Al 20030206 US 2001885356 A 20010620 200336 B 

Priority Applications (No Type Date) : US 2001885356 A 20010620 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20030028520 Al 8 G06F-007/00 

Abstract (Basic) : US 20030028520 Al 

NOVELTY - Each document from an identified candidate set, is 
associated to a specific relevance score bin, based on its total 
matched term weight corresponding to the weights assigned to each term 
of a search query. Most relevant documents are retrieved based on 
association to relevance score bins having highest score, without 
retrieving other candidate documents. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(1) computer readable medium storing response time optimization 
program; 

(2) object retrieval system; 

(3) relevant document retrieval method; 

(4) computer readable medium storing relevant document retrieval 
program. 

USE - For optimizing response time for retrieving relevant 
documents from set of candidate documents, used in ranked data 
retrieval system of networks such as Internet, intranet, local area 
network (LAN), wide area network (WAN), etc. 

ADVANTAGE - The relevant documents are identifiable without having 
to retrieve all the candidate documents, thereby reducing the number of 
retrievals and improving response time for processing search queries. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
the information retrieval and ranking system. 
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Document language determining system connected to Internet, adjusts 
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negative assumption value that indicates whether document is in candidate 
language, based on probability value 

Patent Assignee: ALPHA S A (ALPH-I) 
Inventor: * ALPHA S A* 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20030009324 Al 20030109 US 2001884403 A 20010619 200332 B 

Priority Applications (No Type Date): US 2001884403 A 20010619 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20030009324 Al 10 G06F-017/20 

Abstract (Basic) : US 20030009324 Al 

NOVELTY - A language analyzer (100) determines the probability 
value indicating whether a character string of a document (105) belongs 
to a candidate language. The analyzer adjusts a negative assumption 
value that indicates whether the document is in candidate language, 
based on the probability value. When the assumption value exceeds a 
threshold value, the document in candidate language is determined. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are included for: 

1. Document language determination method. 

2. Process for determining language of document. 

USE - For determining language of document during document 
searching on the Internet or a network. 

ADVANTAGE - The probability of false result is reduced, thus the 
language of the document is determined with high accuracy. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
the document language determination system. 

language analyzer (100) 

document (105) 
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